Magnet.me  -  Het slimme netwerk waarop hbo‑ en wo‑studenten hun baan of stage vinden.

Het slimme netwerk waarop hbo‑ en wo‑studenten hun baan of stage vinden.

Deze vacature is verlopen. Je kunt daarom niet meer liken of solliciteren.

Vergelijkbare vacatures bekijken

Big Data Engineer - Python, Spark and SQL

Geplaatst 26 mrt. 2024
Werkervaring
1 tot 3 jaar
Full-time / part-time
Full-time
Functie
Soort opleiding
Taalvereisten
Engels (Vloeiend)
Nederlands (Vloeiend)

Je carrière begint op Magnet.me

Maak een profiel aan en ontvang slimme aanbevelingen op basis van je gelikete vacatures.

ABOUT THE JOB

We are looking for self-motivated, creative thinkers, people that are flexible and enjoy working in teams. Our data engineering / ETL team is responsible for the development of daily ETL processes in which large amounts of behavioral data from consumer panels are imported in our data lake. The data is used by our software to provide insights into consumer behavior. The ETL processes need to evolve in order to deal with the increase in the size and complexity of the data and to cope with higher requirements with respect to data quality and throughput time. Our data engineering team is pragmatic and keen to apply the best tools for the job. We have wide experience with distributed systems such as Hadoop and Hive, in addition to in-memory distributed computation platforms like Spark. And we develop everything on Linux locally, manage the source code in Git, and run our workflows on AWS in the cloud.

The team has an open culture, works in an agile style and in close cooperation with software developers and colleagues from other disciplines, such as data scientists and client-facing solution managers. You will have the opportunity to develop yourself in areas like big data, cloud computing, data lake architecture and data orchestration.

YOUR PROFILE

  • Bachelor or master’s degree in Computer Science or proven experience in the field;
  • Knowledge of and experience with data engineering best practices is crucial, including the ability to work quickly and independently, communicate well, and furthermore knowing how to devise working solutions, which also generalize towards future workloads.
  • The ability to deal with (big) data workloads is considered a given, like how to deal with daily data quality issues, big data scaling issues (like performance issues and memory constraint issues), etc. Advanced knowledge of Python, SQL and Apache Spark;
  • Knowledge of and experience with Hadoop/HDFS, Hive, PySpark and SparkSQL. Experience with developing in PyCharm is a plus;
  • Knowledge of and experience with AWS (especially S3 and EMR). Experience with working on Linux (Ubuntu, Bash) is a plus;
  • Knowledge of and experience with Git for source code version-control. Experience with GitHub or Gitlab is a plus.
  • Knowledge of and experience with FTP ingestion and data lakes / data warehousing.
  • Knowledge of and experience with column-oriented data storage formats (ORC, Parquet) or Presto/Athena is a plus;
  • Knowledge of and experience with Docker (Compose). Knowledge of and experience with Kubernetes or Airflow is a plus;
  • Team player with flexible, proactive and pragmatic attitude;
  • Preferably residing in the Netherlands. Otherwise willing to relocate to Rotterdam

WHAT WE OFFER

  • Competitive salary and benefits;
  • Personal and professional development opportunities;
  • Flexibility in working hours and location;
  • Exciting development projects and clients;
  • An open, respectful and multicultural atmosphere;
  • Time for socialising and fun;
  • In the office:
  • A Football and a Ping-pong table, and Friday afternoon drinks (every Friday)
  • Daily fruit snacks
  • Reimbursement of traveling expenses
  • Working from home:
  • Weekly team stand ups and frequent online project meet ups
  • Regular online team coffee breaks and events
  • Support with home office equipment
  • 25 days of paid leave;

Nielsen N.V. (NYSE: NLSN) is a global performance management company that provides a comprehensive understanding of what consumers Watch and Buy. Nielsen’s Watch segment provides media and advertising clients with Total Audience measurement services across all devices where content — video, audio and text — is consumed. The Buy segment offers consumer packaged goods manufacturers and retailers the industry’s only…


Nielsen N.V. (NYSE: NLSN) is a global performance management company that provides a comprehensive understanding of what consumers Watch and Buy. Nielsen’s Watch segment provides media and advertising clients with Total Audience measurement services across all devices where content — video, audio and text — is consumed. The Buy segment offers consumer packaged goods manufacturers and retailers the industry’s only global view of retail performance measurement.

By integrating information from its Watch and Buy segments and other data sources, Nielsen provides its clients with both world-class measurement as well as analytics that help improve performance. Nielsen, an S&P 500 company, has operations in over 100 countries that cover more than 90 percent of the world’s population.

For more information, visit www.nielsen.com.

Media
Diemen
10.001 medewerkers