/

Data Engineer

--Remote--

We are looking for a Senior Data Engineer to work on a variety of data projects in a small, fully remote company. If you have extensive experience as a Python developer and you'd like to grow as a Data Engineer this is a perfect place for you!

 

Prophecy Labs is a Data Project House with HQ in Belgium. We are a  team of passionate Data Scientists, Data Engineers and Software Engineers from around the globe. We do end-to-end projects in the Data Science field focusing on challenging problems that provide value.

What you will be doing?

  • Be a part of an agile and enthusiastic team.

  • Work on cutting edge solutions for a variety of projects.

  • Build, maintain, scale, automate, and improve data pipelines related features on our products.

  • Enhance ETL and data science pipelines to work in a fully automated way.

  • Work together with data scientists and developers to automate and put in production data science use-cases.

Must have:

  • Proficiency in English - absolutely mandatory, we’re an English speaking team.

  • At least 2 years working as a Data Engineer or related field.

Job requirements:

We know that this position requires experience both as a Python Developer and Data Engineer, so it would be good if you fulfil at least 5 of them.

  • Experience with working in a production data environment.

  • Experience with CDI/Orchestration/Scheduling/ETL software such as Jenkins/Ansible/Airflow/Nifi.

  • Experience with version control.

  • Experience with REST APIs.

  • Knowledge of optimization of Machine Learning models to work at scale.

  • Experience the end to end big data process.

  • Experience with Google Cloud/AWS.

  • Experience with Python Data Science Stack (pandas, scikit learn, keras, numpy, tensor flow).

  • Experience with Machine Learning.

  • Masters degree in computer science or other related fields (physics, mathematics, engineering, and bioinformatics.


Tools you will work with:

  • Python (Data Science stack (pandas, pytorch, sklearn, tensorflow etc. + Flask/Django)

  • Jenkins

  • Airlflow

  • Nifi

  • Spark

  • SQL