About the company As part of the methods development team in the Translational Genomics Group, you will have the opportunity to make substantial contributions to high-impact projects with direct implications for clinical practice, as well as to participate in the vibrant research environment at the Broad, with its close links to MIT, Harvard, and the Harvard-affiliated hospitals across Boston. You will have access to data sets of extraordinary scale and to colleagues with deep expertise in genetics, computational biology, software development, and machine learning. The responsibilities of this role align closely with the mission of the Broad to transform medicine and human health through cross-disciplinary collaboration and the development of pioneering technologies to analyze scientific data on an unprecedented scale.
We are an Agile team running production and development in a Scrum framework, and we care deeply about managing our work well, maintaining healthy work-life boundaries, and investing in the professional growth of our team members. You will have access to Broadโs thoughtful and well-resourced leadership development and management training programs, in addition to a generous vacation policy and benefits package. We operated on a hybrid remote/in-office work schedule even before the pandemic, and we expect to continue this model moving forward and are able to accommodate any candidates living within the New England region.
Job description
Since 2016, the Genome Aggregation Database (gnomAD) has been a pioneer in human genomic data aggregation through the regular public release of data for a rapidly growing collection of exomes and genomes sampled from diverse populations across the globe. gnomAD is the default resource used in virtually every clinical variant interpretation pipeline today, and our browser has generated over 39 million page views to date, with tens of thousands of regular monthly users.
We are seeking a creative, self-motivated candidate at the PhD level to play a critical role in designing and developing fast, automated, open-source computational pipelines to produce high-quality public data releases for forthcoming โ and exponentially growing โ datasets in gnomAD. The role will involve close collaboration with scientists across the Broad to develop novel approaches for quality control and analysis of our highly heterogeneous datasets at exceptional scale, as well as the eventual supervision of associate computational scientists in the group who will be assigned to work alongside the candidate. The candidate will also have the opportunity to interact closely with Hail developers at the Broad to play a role in the feature design of the fieldโs most cutting-edge toolkit for massively parallel, high-throughput computation of genetic data. As this role involves collaboration with a wide variety of staff across disciplines, including computational scientists, academic trainees, software engineers, biologists, and clinical geneticists, we are specifically looking for a candidate who works well in teams.
Growing a strong team with a diversity of life experiences and backgrounds, who foster a culture of continual learning and who support the growth and success of one another, is key to our success. We are therefore committed to seeking applications from women and from underrepresented groups.
Career development opportunities for this role:
Supervising and mentoring associate computational staff scientist(s) assigned to work on gnomAD releases, including weekly check-ins, quarterly performance reviews, and discussions on career development. Management training and hands-on mentorship in this area will be provided, depending on candidateโs previous experience managing others
Setting concrete objectives and tasks, professional standards, and expectations for associate staff scientist(s); helping them to prioritize tasks, troubleshoot technical issues, locate resources (people and tools), and manage relationships with collaborators
Handling general supervisory/HR administrative tasks for associate staff, including approving vacations and expense reports, writing annual performance reviews, and making recommendations for promotions and salary increases
Characteristics and Qualifications:
The role will require an independent and highly motivated candidate with the ambition to maintain and develop a significant and sophisticated body of code that is used on a regular basis to produce large, public data releases with a highly active and invested user community.
You will have domain expertise in computational methods for analyzing next-generation sequencing data, as well as an interest in the technical aspects of deploying these methods at scale.
We are looking for someone who:
Is able to write clean, efficient, robust, and usable code, with demonstrated proficiency in one of the following: Unix/Linux, Python, Java, C++, Matlab, or R, with a strong preference for Python, Unix/Linux, and R
Has a Ph.D. in mathematics, computer science, engineering, physics, mathematics, statistics, biology, or another related field; or equivalent professional experience
Has demonstrated experience in quantitative (statistical, mathematical, computational) research with large data sets; skill and experience with statistical analysis and/or computational biology is strongly preferred โ with special consideration for individuals with prior experience using the Hail Python library
Has fluency with human genetics and next-generation sequencing data; ideally will have prior experience with the quality control of such datasets
Exhibits strong initiative and the ability to take ownership of complex projects and interest in the management and development of a team
Cares passionately about the quality of his/her work and demonstrates zealous attention to detail; is curious and tenacious about investigating anomalies in data
Is familiar with Git and modern team-based software development practices, including peer code review through pull requests
Listens, communicates, and collaborates well with team members, clinicians, software developers, and research scientists; is receptive to feedback and willing to provide constructive feedback to others; demonstrates kindness to others
Demonstrates excellent written and oral presentation skills
Manages time well and is able to respond to shifting priorities in a fast-paced and rapidly changing environment
Instructions how to apply Apply at this url: https://Jobhunt.ai/machinelearning-ml-ai-job-we-Computational-Scientist-I-Genome-Aggregation-Database-Cambridge-Broad-Institute-of-Harvard-and-MIT.html Please mention that you found the job at Jobhunt.ai
[ job website ]
Other machine learning jobs that might be interesting
Software Engineer, Machine Learning - Reverie Labs(May 2022) Cambridge, Massachusetts, United States
At Reverie Labs, weโre building a pharmaceutical company from the ground up using computationโweโre a drug company that looks and feels like a tech company. Weโre a team of engineers and machi...Senior/Staff Machine Learning Engineer - BrightHire(April 2022) Remote US, 100% Remote
Our mission is to fix hiring, making it fundamentally more equitable, effective, collaborative, and human. Weโre doing this by building the first intelligent platform for the most important part of ...
Senior Deep Learning Researcher (Speech Recognition) - AssemblyAI Worldwide, 100% Remote - Salary: $140,000 - 225,000
AssemblyAI is an AI company - we build powerful models to transcribe and understand audio data, exposed through simple APIs.
Hundreds of companies, and thousands of developers, use our APIs to both ...
Senior Machine Learning Engineer - PathAI(April 2022) Boston, Massachusetts, United States (Remote work possible) Boston, MA or Remote
PathAI's mission is to improve patient outcomes with AI-powered pathology. Our platform promises substantial improvements to the accuracy of diagnosis and the efficacy of treatme...Senior Machine Learning Engineer - Cohere Health(April 2022) Boston, Massachusetts, United States (Remote work possible) In this role, youโll join our growing team of world-class engineers, statisticians, and clinical experts to deploy machine learning algorithms that help automate burdensome administrative clinical p...