Research Scientist, Generative AI Evaluations, Health AI
Company: Google
Location: Mountain View
Posted on: July 1, 2025
|
|
Job Description:
Minimum qualifications: PhD degree in Computer Science, a
related field, or equivalent practical experience. One or more
scientific publication submission(s) for conferences (eg,
CVPR/ECCV/ICCV/NeurIPS/ICLR), journals, or public repositories
Experience in AI/ML research and Generative AI agent development.
Preferred qualifications: 4 years of experience in AI/ML research
or agent development. Experience with human-in-the-loop evaluation
methodologies, including designing annotation/rating tasks and
managing data quality. Understanding of current AI agent
architectures (e.g., LLM-based agents, reinforcement learning
agents) and their evaluation issues. Ability to have a proven track
record of leading evaluation efforts for complex AI systems,
particularly AI agents or interactive AI. Familiarity with the
unique tests of evaluating AI in health-related applications, such
as safety, need for clinical validation etc. Excellent programming
skills with Python. About the job As an organization, Google
maintains a portfolio of research projects driven by fundamental
research, new product innovation, product contribution and
infrastructure goals, while providing individuals and teams the
freedom to emphasize specific types of work. As a Research
Scientist, youll setup large-scale tests and deploy promising ideas
quickly and broadly, managing deadlines and deliverables while
applying the latest theories to develop new and improved products,
processes, or technologies. From creating experiments and
prototyping implementations to designing new architectures, our
research scientists work on real-world problems that span the
breadth of computer science, such as machine (and deep) learning,
data mining, natural language processing, hardware and software
performance analysis, improving compilers for mobile platforms, as
well as core search and much more. As a Research Scientist, youll
also actively contribute to the wider research community by sharing
and publishing your findings, with ideas inspired by internal
projects as well as from collaborations with research programs at
partner universities and technical institutes all over the world.
Our mission is to help people in understanding and improving their
health. Our team works towards this goal by developing innovative
solutions like a personal health agent and new sensing and
intelligence features for health and home across Google surfaces.
Google Health is a company-wide effort to help billions of people
be healthier. We work toward this vision by meeting people in their
everyday moments and empowering them to stay healthy and partnering
with care teams to provide more accurate and accessible care. Our
teams are applying our expertise and technology to improve health
outcomes globally – with high-quality information and tools to help
people manage their health and wellbeing, solutions to transform
care delivery, research to catalyze the use of artificial
intelligence for the screening and diagnosis of disease, and data
and insights to the public health community. The US base salary
range for this full-time position is $141,000-$202,000 bonus equity
benefits. Our salary ranges are determined by role, level, and
location. Within the range, individual pay is determined by work
location and additional factors, including job-related skills,
experience, and relevant education or training. Your recruiter can
share more about the specific salary range for your preferred
location during the hiring process. Please note that the
compensation details listed in US role postings reflect the base
salary only, and do not include bonus, equity, or benefits. Learn
more about benefits at Google . Responsibilities Ensure robust and
iterative evaluation of health features, establishing methods for
continuous monitoring, feedback loops, and adaptation to new data
or clinical insights. Collaborate with cross-functional teams
(research, engineering, product, clinical, legal, ethics) to define
evaluation requirements, interpret results, and drive improvements
in AI models and health applications. Conduct in-depth analysis of
AI agent behavior, identify failure modes, and provide actionable
insights to improve model performance and safety. Stay at the
forefront of research in AI evaluation, particularly in areas of
agent-based systems, human-AI interaction, and the specific issues
of evaluating AI in consumer health and wellness. Develop and
advocate best practices, tools, and documentation for AI agent
evaluation across health-related projects.
Keywords: Google, Cupertino , Research Scientist, Generative AI Evaluations, Health AI, IT / Software / Systems , Mountain View, California