Data Engineer - Big Data

Farfetch (NYSE: FTCH)

  • Location
    • Porto
  • Date Posted
  • 18 Jan 2021
  • Function
  • Data Science
  • Sector
  • Retail

The role

We are looking for a person who will be part of our Business Intelligence & Analytics team, this position will be in charge of the development of high performance, distributed computing tasks using Big Data technologies such as Hadoop, NoSQL and other distributed environment technologies based on the needs of the organization. You will also be responsible for analyzing, designing, programming, debugging and modifying software enhancements and/or new products used in distributed, large scale analytics solutions.

What you'll do

  • Design scalable, end-to-end process to consume and integrate large volume, complex data from sources such as Hive, Flume, Kafka or Storm;
  • Provide Data Engineering expertise to multiple teams across our organization. Provide guidance to software engineers with industry and internal data best practices;
  • Build fault tolerant, adaptive and highly accurate data computational pipelines. Tune queries running over billion of rows of data running in a distributed query engine;
  • Research and implement new data technologies ;
  • Work with other teams to understand needs and provide solutions;
  • Work with the Business Intelligence development team on migration and improve existing SQL Server-based ETLs to Map Reduce and Hive (Cloud) technology to achieve scale and performance;
  • Help develop new processes on the data warehouse platform and work with Data Scientists to transform big data into model-­‐ ready forms to support projects.

Who you are

  • Experienced in working with large datasets (both structured and unstructured) using technologies such as MapReduce, Hadoop, HBase, Hive, Spark and NoSQL technologies;
  • Strong at programming background with languages such as Java, C++, or Python;
  • Knowledge in distributed systems;
  • A professional with a background in working in cloud environments – AWS, Rackspace, Azure;
  • Experienced with real-time analysis of sensor and other data from Internet of Things (IoTs) or other connected devices ;
  • Excellent in grasping of algorithmic concepts in computer science (e.g., sorting, data structures);
  • Experienced in the development and release of enterprise scale applications;
  • Experienced with version control.
  • Manage development of distributed computing tasks using Big Data technologies such as Hadoop, NoSQL and other distributed environment technologies based on our needs.