Data Scientist / Sr. Data Scientist

Data Scientist / Sr. Data Scientist

Location: San Francisco Bay Area (San Carlos, CA) Or Singapore

Full Time

Engine Biosciences is a venture-backed biotechnology company discovering and developing novel therapeutics and precision medicines, utilizing a proprietary platform that integrates massively parallel biological experimentation with data science, machine learning and AI.  Led by scientific experts from MIT, Harvard, Mayo Clinic and UCSD, and successful drug developers, informaticians, and company builders, Engine is working on multiple programs and therapeutic areas and growing rapidly across US and Asia.

We are seeking a Data Scientist or Senior Data Scientist with strong expertise in machine learning, statistics or computational biology to design data infrastructure and predictive models for accelerated target and biomarker discovery. You will work with integrating, modeling and interpreting a diverse set of biological data including genomics data, pathway and interaction data and text mining data from internal and external sources to drive scientific insight. You will have a track record of success in deriving insights from heterogenous sources of biological data through machine learning approaches.  You will also bring hands-on experience in algorithm development, data science, machine learning and cloud computing, and will have applied these skills to ideally target discovery. Essential for this role, you will collaborate globally with Engine’s teams and external partners in Asia, US and EU.

Minimum Requirements

  • Ph.D. in Computational Biology / Statistics / Machine Learning / Computer Science, or a related field.

  • Expertise in comprehensive set of data analytics and modeling methods using machine learning and deep learning approaches and deep understanding of the theories and algorithms

  • Familiarity with multi-dimensional omics data and biochemical experimental data and their analysis

  • Experience in cloud based (e.g. AWS) data science. Familiarity with SageMaker is an advantage.

  • Programming skills in at least two of: Python, AngularJS, Node.js., Java. R., Matlab, SQL and NoSQL dbs.

  • Familiarity with DevOps and good software practices (i.e. version control, continuous integration, pipeline development and deployment)

  • Strong communication and global collaboration skills.

Desired Skills & Experience

  • 2+ years of hands-on industry experience in algorithm development and machine learning in application for drug discovery and biomarker identification

  • Experience in natural language processing approaches

  • Strong peer reviewed publication record in data science, statistics, bioinformatics, and machine learning.

  • Experience in communicating biological interpretation to the science team and executive leadership

  • Experience in identifying external datasets to support project needs

  • Amazon Web Services (AWS) experience including DevOps, Security, VPC, EC2, EMR, Docker, SPARK, ElasticSearch, Lambda, Redshift, and Amazon Machine Learning (TensorFlow, MXNet, SageMaker).

  • Experience working across multiple time zones with executive leadership, scientists and engineers.

  • Candidates must hold a valid working visa, residency or citizenship of the US or Singapore.

Apply Now

To apply, please e-mail your CV to