2023–2024
Staff Data Scientist, Lead, Core ML Data Science
- Assessed data quality at scale, measured rater performance, and automated the cleaning of anomalies from
Google's training data for large language models company-wide.
-
Led the development of a web app using a PaLM 2 language model to
assist in the construction of complex SQL queries over a large data
repository.
-
Found previously undetected hits in a large-scale antimalarial drug
screen via a novel application of machine learning to microscopy
images of treated parasites.
Source
-
Devised and implemented an approach to counteracting batch effects
in a cellular assay using
optimal
transport over Gaussian mixture models; code contributed to
Optimal Transport Tools.
-
Tech lead responsible for the development and implementation of
Verily's pipelines for QA and processing of RNA-Seq data.
-
Devised and implemented a novel algorithm for estimating levels of
DNA contamination in RNA-Seq data.
2012–2014
Staff Data Scientist, Search Infrastructure
-
Gave regular presentations to Senior Vice Presidents on the
transition of Google's search users from desktop to mobile.
-
Created long-term forecasts for global query volume to guide
decisions on the placement of future data centers.
-
Analyzed large datasets to provide insights that influenced AdWords
and News product team decisions.
-
Established a set of AdWords user metrics and developed the
necessary infrastructure to gather data from diverse sources.
-
Introduced a novel online help format for AdWords, secured resources
for development, and showcased its effectiveness. The format was
adopted by Gmail, AdSense, and Webmaster Tools.
-
Won an internal business competition with an ad format tailored for
an important niche market, guided its realization, and filed a
related patent.
Collected Insight
San Francisco, CA; Raleigh, NC
-
Core developer for Plone, an open
source content management system. Member of the board of directors
of the Plone Foundation,
2004–2006.
-
Created one of the leading graduate school guides based on data from
government sources. Written in Ruby on Rails.
-
Principal investigator for the Sigma Xi Postdoctoral Survey project,
a national study of young scientists. Raised funds via grants from
the Alfred P. Sloan Foundation and
the Burroughs Wellcome Fund.
- Continued part-time 2007–2013.
-
Researched methods for the efficient storage and transmission of
digital multimedia.
1998–1999
Researcher / Software Engineer, Semantic Platform Group
-
Developed algorithms and software for analyzing and manipulating
semantically annotated data.
Corporate Communication Group
New York, NY
1992
Senior Programmer, Interactive Technologies
-
Designed and implemented an unencumbered, third person virtual
reality software platform in C++. Developed software for optical
tracking of hand position, a sprite control object library, control
software for video and audio hardware, communications software to
enable remote virtual interaction. Gold medalist in the New Media
Magazine 1993 Multimedia Awards, gold medalist at the 1992 New York
Festivals, and a silver medalist at the 1992 Association of Visual
Communicators' CINDY awards.
Patent