Data Engineering (healthcare)
Pacmed is hard op zoek naar stagiairs en/of werkstudenten voor het Data Engineering team. Onze Data Engineers zorgen dat de machine learning modellen die wij bouwen ook daadwerkelijk in de productie software geïmplementeerd worden.
What will you do?
Data Engineers at Pacmed will help make sure that Machine Learning models add value to everyday life in Hospitals and General Practices. They provide the Data Science team with the proper environment to develop models by (for example) building a Data Lake or setting up a Data Warehouse. When models are ready to be used in production, they will take care of the implementation in different kinds of environments.
Important aspects of the Data Engineer job include:
- Ensure an end-user (typically a doctor) will have access to Machine Learning predictions models by building software solutions that either run in a cloud platform, or on the clients private servers - Build data lake and data warehouse solutions to streamline the development and training / testing of our Machine Learning models - Ensure that running models are properly monitored - Ensure that data and predictions provided by running models are properly ingested back into our data-solutions - Help to choose between an Object-Oriented, Functional Programming or Event-sourcing approach for different projects
Interns or student employees will assist the Data Engineers at Pacmed with one or more of such projects. As an intern there is also the opportunity to write your Master's thesis at Pacmed. In this case we will choose an appropriate project together with you and your supervisor.
What do you get?
A lot of opportunity for personal development (e.g. development Fridays, soft skills trainings, budget for online courses, visiting conferences).
You get the opportunity to learn a lot from experienced Data Engineers and Data Scientists.
You work together within a vibrant community of experienced data scientists & engineers and leading technical and medical academics.
You work in an environment where we continuously ensure that our software is of production-level quality through Merge Requests, code review and unit- and integration testing.
You work closely together in an enthusiastic and ambitious team, dedicated to make health care smarter and better.
A dynamic working environment where fun at work is very important.
An office at the FreedomLab campus, including unlimited coffee & snacks, Friday drinks, ping-pong competitions and a great location in the heart of Amsterdam
What are we looking ofr?
- Master’s degree in Computer Science or similar study (or currently studying) . - Programming experience in Python (and knowledge on object-oriented software design) - Basic knowledge of and interest in Machine Learning
- Experience with distributed systems, preferably with Hadoop and Apache Spark - Experience with streaming data, preferably in Apache Kafka - Experience with NoSQL storage like ElasticSearch, Cassandra or HBase - Experience with building Data Lakes and Extract-Transform-Load (ETL) pipelines - Programming experience with Scala or Java - Experience with Docker / Kubernetes - Understanding and speaking Dutch is preferred but not required