München, DE

Data Engineer (m/f/d)

Main Duties and Responsibilities

  • Help us create AI / ML ready datasets from Petabytes of raw data and meta-data
  • Automate integration of different data-sources into a coherent flow/ data pipelines (support also data normalization and result calculation)
  • Develop and build systems and architectures for ETLs
  • Perform system & data testing
  • Understand and apply FAIR data principles
  • Strong adherence to compliance & regulatory environments
  • Build algorithms to

Essential Requirements

  • Computer Science, Engineering, or Bioinformatics (Master level) plus 5 years relevant experience
  • Excellent programming skills (Python, C++, R)
  • Experience in designing and implementing RESTful APIs and webservices
  • An ability to interact with various data sources, both structured and unstructured (e.g. HDFS, SQL, noSQL)
  • Experience working across multiple scientific compute environments to create data workflows and pipelines (e.g. HPC, cloud, Unix/Linux systems)


  • Expertise with biological/health data
  • Experience modelling data and information for graph/network representation,
  • Experience of working with metadata models, controlled vocabularies and ontologies
  • Ability to understand, map, integrate, and document complex data relationship and business rules
  • Familiarity with data quality, cleaning and masking techniques
  • Modern frameworks and concepts for scalable and distributed computation (containerization and orchestration e.g. k8s, specialized frameworks such as Spark, Hadoop, …)
  • Experience with image processing and computer graphics
  • Experience with cloud computing

We are looking forward to your application!

About Definiens

Are you passionate about advanced image analysis? Can you apply these skills to help us in drug and diagnostic biomarker discovery? Do you have expertise in applying sophisticated algorithms and image analysis techniques to complex imaging data and have a thirst for applying these skills to advance the boundaries of science to deliver life-changing medicines to patients? Welcome to join Definiens, the AstraZeneca Translational Medicine Team in Munich! As our new Image Data Scientist, you will apply cutting edge technologies and advanced image processing and analysis to help derive biological insights from biological tissue to support drug and biomarker development projects. The work will be alongside our team of pathologists, translational data scientists and software developers within Definiens. Definiens is changing the way imaging technologies are combined with other omics technologies and clinical patient data to offer unprecedented insight and impact. We empower translational science to transform cancer treatment with actionable information derived from biomedical image by innovating at the interface between Artificial Intelligence, pathology and data sciences.