Junior Data Engineer, hibrido
Empresa
BNP Paribas
Provincia
Madrid
Ciudad
Madrid
Tipo de Contrato
Tiempo Completo
Descripción
Junior Data Engineer
About the job
Junior Data Engineer South Europe Technologies (S.ET), BNP Paribas Personal Finance
South Europe Technologies (S.ET) is one of BNP Paribas Personal Finance shared services centers delivering the best IT Solutions to BNP Paribas Personal Finance entities around the world:
- Applications Management (Architecture, Project management, Development, and Quality Assurance)
- IT Risks Cybersecurity services
- Platforms management
- Data
- Ad-hoc, T M development
In this context, we are looking for a Data Engineer.
As a Data Engineer, your mission is to design, implement, and optimize robust data pipelines and infrastructure, enabling reliable, secure, and high-performance data flows throughout the organization. You will work closely with stakeholders and multidisciplinary teams to support data integration, transformation, and delivery processes, contributing to the ongoing evolution and stability of our data platforms.
Your main activities are to:
- Implement and maintain orchestrators and scheduling systems to automate data pipeline execution (e.g., Airflow as a service).
- Modify and enhance existing codebases in line with business requirements, continuously driving improvements in performance and maintainability.
- Monitor, ensure, andoptimizethe performance and security of the data infrastructure, applying best practices in Data Engineering.
- Contribute to production support, incident resolution, and anomaly correction, as well as support functional and technical evolutions to ensure process stability.
- Develop andmaintaincomprehensive technical documentation to ensure effective knowledge capitalization.
- Assistin building andmaintainingdata pipelines using Spark on Scala for collecting and processing data from diverse sources such as Kafka topics, APIs, HDFS, and structured databases.
- Support data transformation activities and contribute to data quality assurance, ensuring the reliability and accuracy of information.
- Help set up CI/CD pipelines under the guidance of senior team members to automate testing and deployment.
- Learn and employ orchestration tools like Airflow for scheduling and automating data workflows.
- Make incremental improvements to code and contribute to performance enhancements asrequired, aligned with business needs.
- Participate in monitoring data infrastructure for performance and security, learning and applying industry best practices.
- Assistwith production support tasks, including incident identification and resolution, and support ongoing technical improvements.
- Document and update technical processes to ensure clear records of changes and procedures.
IT Tools
- Good knowledge of
- Spark on Scala
- CI/CD tools (Gitlab, Jenkins...)
- HDFS and structured databases (SQL)
- Full understanding of
- Apache Airflow
- Streaming process (Kafka, event steam...)
- S3 storage
- Shell script
- Some knowledge of
- Kubernetes
- Optionally/ as a plus
- Elasticsearch and Kibana
- HVault
- Dremioas tool to virtualize data
- Dataiku
What we are looking for
- Demonstrated knowledge of the banking sector and related business processes
- Experience in managing business and IT relationships
- Ability to understand, explain, and support change initiatives
- Results-driven mindset andcapacityto deliver
- Strong collaboration and teamwork skills
- Ability to synthesize and simplify complex technical topics
- Proficiencyin analytical thinking and resilience in handling challenges
- Desirable: Familiarity with tools such as DWH, Dataiku, Spark, Airflow, S3, Kubernetes, and CI/CD platforms
Language Skills
- English: B2 level or higher
- French: B1 level (optional)
Benefits
- Training programs, career paths, and opportunities for internal mobility-nationally and internationally-thanks to our global presence
- Diversity and Inclusion Committee fostering an inclusive work environment, with employee communities organizing awareness actions (PRIDE, We Generations,MixCity, etc.)
- Corporate volunteering program (1MillionHours 2 Help) supporting employees in their commitment to volunteering activities
- Flexible compensation plan
- Hybrid telecommuting model (50 )
- 31 vacation days
Spark, Scala, Airflow, Kafka
About the job
Junior Data Engineer South Europe Technologies (S.ET), BNP Paribas Personal Finance
South Europe Technologies (S.ET) is one of BNP Paribas Personal Finance shared services centers delivering the best IT Solutions to BNP Paribas Personal Finance entities around the world:
- Applications Management (Architecture, Project management, Development, and Quality Assurance)
- IT Risks Cybersecurity services
- Platforms management
- Data
- Ad-hoc, T M development
In this context, we are looking for a Data Engineer.
As a Data Engineer, your mission is to design, implement, and optimize robust data pipelines and infrastructure, enabling reliable, secure, and high-performance data flows throughout the organization. You will work closely with stakeholders and multidisciplinary teams to support data integration, transformation, and delivery processes, contributing to the ongoing evolution and stability of our data platforms.
Your main activities are to:
- Implement and maintain orchestrators and scheduling systems to automate data pipeline execution (e.g., Airflow as a service).
- Modify and enhance existing codebases in line with business requirements, continuously driving improvements in performance and maintainability.
- Monitor, ensure, andoptimizethe performance and security of the data infrastructure, applying best practices in Data Engineering.
- Contribute to production support, incident resolution, and anomaly correction, as well as support functional and technical evolutions to ensure process stability.
- Develop andmaintaincomprehensive technical documentation to ensure effective knowledge capitalization.
- Assistin building andmaintainingdata pipelines using Spark on Scala for collecting and processing data from diverse sources such as Kafka topics, APIs, HDFS, and structured databases.
- Support data transformation activities and contribute to data quality assurance, ensuring the reliability and accuracy of information.
- Help set up CI/CD pipelines under the guidance of senior team members to automate testing and deployment.
- Learn and employ orchestration tools like Airflow for scheduling and automating data workflows.
- Make incremental improvements to code and contribute to performance enhancements asrequired, aligned with business needs.
- Participate in monitoring data infrastructure for performance and security, learning and applying industry best practices.
- Assistwith production support tasks, including incident identification and resolution, and support ongoing technical improvements.
- Document and update technical processes to ensure clear records of changes and procedures.
IT Tools
- Good knowledge of
- Spark on Scala
- CI/CD tools (Gitlab, Jenkins...)
- HDFS and structured databases (SQL)
- Full understanding of
- Apache Airflow
- Streaming process (Kafka, event steam...)
- S3 storage
- Shell script
- Some knowledge of
- Kubernetes
- Optionally/ as a plus
- Elasticsearch and Kibana
- HVault
- Dremioas tool to virtualize data
- Dataiku
What we are looking for
- Demonstrated knowledge of the banking sector and related business processes
- Experience in managing business and IT relationships
- Ability to understand, explain, and support change initiatives
- Results-driven mindset andcapacityto deliver
- Strong collaboration and teamwork skills
- Ability to synthesize and simplify complex technical topics
- Proficiencyin analytical thinking and resilience in handling challenges
- Desirable: Familiarity with tools such as DWH, Dataiku, Spark, Airflow, S3, Kubernetes, and CI/CD platforms
Language Skills
- English: B2 level or higher
- French: B1 level (optional)
Benefits
- Training programs, career paths, and opportunities for internal mobility-nationally and internationally-thanks to our global presence
- Diversity and Inclusion Committee fostering an inclusive work environment, with employee communities organizing awareness actions (PRIDE, We Generations,MixCity, etc.)
- Corporate volunteering program (1MillionHours 2 Help) supporting employees in their commitment to volunteering activities
- Flexible compensation plan
- Hybrid telecommuting model (50 )
- 31 vacation days
Spark, Scala, Airflow, Kafka