Data Engineer

Location: San Francisco

About Plasmidsaurus

Plasmidsaurus is on a mission to accelerate new cures and promote a healthier planet by unlocking a new level of productivity for scientists who use DNA tools to bring their ideas to life. Our DNA sequencing tools are used daily by thousands of innovators, including Nobel prize winners, dynamic biotech startups, research labs, and DIY biohackers. Our global network of labs operates day and night to enable world-changing discoveries. In 2024, we’re going to save these scientists 2 million hours of time, radically accelerating their research. Every team member at Plasmidsaurus plays a crucial role in driving forward the future of biotech research.

Position Overview

As a Senior Data Engineer, you would have ownership of data pipelines that shuttle terabytes of raw sequencing data between the cloud and our sequencing devices, launch our bioinformatics algorithms, and manage the overall communication between our microservices and sequencers. Our customers rely on their overnight sequencing results to conduct world-changing research, so building a robust data architecture is critical. In this position, you will:

  • Design, build, and maintain a robust data architecture to shuttle raw sequencing data between our on-prem sequencers and our cloud infrastructure
  • Deploy new pipelines with infrastructure-as-code best practices
  • Build microservices to coordinate decisions across a global fleet of sequencers
  • Develop a robust architecture for bioinformatics task orchestration
  • Collaborate with a diverse team of scientists and engineers

Culture

You are someone who:

  • Enjoys high agency and high ownership work
  • Is curious about learning from other people’s expertise in scientific/biological domains
  • Is excited to make an impact on the scientific community, enabling the next generation of medicines, biotechnology, and even plant-based-meats!
  • Can lead projects from idea to production independently
  • Is a life-long learner
  • Is excited to work in a tight-knit fast-paced environment with a motivated team
  • Values clear communication and helping other learn new skills

Qualifications

  • BS, MS, or PhD in Computer Science, or relevant work experience
  • 5+ years of industry experience
  • Fluent in Python and SQL
  • Experience with data modeling, ETL, and data warehousing
  • Experience working with AWS and deploying resources with infrastructure-as-code best practice
  • Experience building cloud pipelines, containerized applications (docker), and container orchestration (ECS, kubernetes, etc.),
  • Experience with Linux environments and version control (git)
  • (Preferred) experience with best practices implementing network security in on-prem and virtual private clouds
  • (Preferred) experience working in a startup environment

Why Plasmidsaurus?

  • IMPACT: Your work will directly contribute to accelerating biotech research. This research is one of humanity’s most powerful tools for stopping climate change and developing novel therapies.
  • INNOVATION: Work on the cutting edge of biotech and software, introducing indispensable tools to top-tier researchers.
  • COMMUNITY: Join a passionate, scrappy team that values close, interactive relationships with our customers and each other.

Please apply on LinkedIn .