This position is no longer open as of 2023 October. Please see /join for a list of open positions.

A computational biology research group in the Center for Public Heath Genomics at the University of Virginia led by Nathan Sheffield is recruiting a programmer with skills in Python and/or R.

About the lab

The Databio group (http://www.databio.org) is an interdisciplinary and collaborative computational biology research group located in the Center for Public Health Genomics at UVA. We are also affiliated with the Department of Biomedical Engineering, Department of Biochemistry and Molecular Genetics, Department of Public Health Sciences, the School of Data Science, the Cancer Center, and the Child Health Research Center at UVA.

Our research is at the interface of computation and biology, drawing on techniques in computer science, data science, bioinformatics, and machine learning, and applying them to biological questions in cancer, epigenetics, single-cell analysis, development, and genomics. We collect both novel data and public data and make use of UVA’s high-performance cluster for computational approaches to biological questions.

Our biological questions are focused on understanding gene regulation and epigenetics in development and disease, such as cancer, atherosclerosis, and kidney disease. How does DNA encode regulatory networks that enable cellular differentiation? We rely on experimental data from sequencing-based epigenome experiments like ATAC-seq, bisulfite-seq, and ChIP-seq, and we use these data to study fundamental principles of regulatory DNA in human health.

We are building a team of intelligent, creative people who are interested in working together to accomplish great things. We collaborate extensively. We emphasize social coding, using GitHub to share code both within the group and so others can benefit from our work. We seek to write readable, reusable code and apply it to new biological questions. We challenge the norm in academic computational research of individual scientists writing isolated code, and instead push open, multi-author code development. If these topics excite you, please read more about our research interests, recent publications, and philosophy of open data.

About the university

The University of Virginia, founded by Thomas Jefferson in 1819, is the only collegiate UNESCO World Heritage site in the US. It located near Shenandoah National Park in the beautiful foothills of the Appalachian Mountains in Charlottesville, Virginia. UVA routinely ranks in the top 3 public universities in the United States, with a co-located School of Medicine and a top-tier environment for research. Charlottesville is also a great place to live. If you’re unfamiliar with area, please read the many reasons you should consider coming to UVA.

Job description

Our tech stack includes Python, R, C++, PostgreSQL, Docker, AWS, Azure, React, Bioconductor, FastAPI, and NodeJS. We are both building new software, and using it to process biological data, and the successful applicant will have opportunies in both areas. The scientific programmer will have two primary responsibilities:

  1. Contributing to high quality bioinformatics software. The programmer will be responsible for improving methods developed within our group, such as generalizing them to build robust software that is useful beyond the initial project. This will include code review and will potentially include building web-accessible user interfaces to increase outside access.

  2. Processing large biological data sets through computational pipelines. This will require curating raw input data, documenting data in biological projects, configuring software, and using local supercomputing resources (SLURM) to process the data. This will include monitoring and improving pipeline performance and developing reports to describe data quality and summarize results. The programmer will curate hundreds of terabytes of raw and processed disk space and work with professional systems administrators.

Interested in joining?

For this position, you should have some interest in either building bioinformatics applications or analyzing large biomedical datasets.

Qualifications:

  • Computational experience with scripting in R or Python is required.
  • Experience with git and GitHub is required
  • Skill with C++ and object-oriented programming is preferred but not required.
  • Knowledge of biology would be a plus, but not required – however, you must be willing to work with biological data and communicate with biologists.
  • Most important is a willingness to contribute to team projects (such as building communal software) and demonstrated commitment to sharing software and data with the community.
  • Candidates at all levels are acceptable; your experience level (undergraduate, bachelors degree, or masters degree) is less important than your coding skills, work ethic, and team spirit.

Include with your application:

  • Your education and publication history, if any
  • Your career goals, and what you think you would gain from experience in our group
  • Why you think you would fit well, and what you would contribute
  • Your biggest ideas for what projects or types of projects you find exciting
  • A link to GitHub or other public repository with code you showcase, if any

The University of Virginia is an equal opportunity and affirmative action employer. Applications will be reviewed on a rolling basis and the position will remain open until filled.

Informal inquiries and application materials can be directed to the attention of Dr. Nathan Sheffield at . But ultimately, official applications must be submitted at the official UVA application system.