Here at BenchSci, we accelerate drug discovery by using machine learning to facilitate successful experiments. We’re backed by Gradient Ventures, Google’s AI fund, and built by life scientists for life scientists. We’ve become the world leader in AI-assisted antibody selection, and we’re growing!
We are currently seeking a SeniorData Engineer/Tech Lead to join our Data Team. As part of the job, you will work on evolving our data models in several styles of datastores, improve internal tooling to allow data self-service, and operationalize production-grade data pipelines.
What you’ll do:
Collaborate with life scientists and machine learning engineers on how to capture and model additional scientific experiments
Create tooling for low-friction data movement between Neo4J, SQL and Spark
Develop frameworks to detect model drift, recalibrate, and redeploy them to production seamlessly
Extend the data pipeline from using semi-structured xml to also capture unstructured text
Collaborate closely with other engineers to solve interesting and challenging data problems
Who we’re looking for:
5+ years working as a professional developer
Experience with SQL
Experience with cloud reference architectures and developing specialized stacks on cloud services
Expertise in Spark 2.x, Dataset/DataFrame API and performance tuning
Experience with R or Pandas
You have strong cross-team communication and collaboration skills
A team player who strives to see teammates succeed together