WA

Data Engineer III

Walmart

3 months ago

5 - 7 years

Hybrid

Bengaluru, Karnataka, India

  • Monitor and troubleshoot data pipelines to ensure data availability and reliability
  • Conduct performance tuning and optimization of data processing systems for improved efficiency and scalability
  • Create data tools for analytics team members that assist them in building and optimizing our product into an innovative industry leader.
  • Google Cloud Platform (GCP)

    Big Query

    Kafka

    Data Modeling

    ETL Processes

    SQL/NoSQL

    Version Control (Git)

    CI/CD pipeline

    Business Intelligence

    Data Visualization (Tableau, Power BI)

    Artificial Intelligence (AI)

    Job description & requirements

    What You’ll Do

    1. As a Data Engineer, you will play a critical role in designing, developing, and implementing data pipelines and data integration solutions using Spark, Scala, Python, Airflow and Google Cloud Platform (GCP).
    2. You will be responsible for building scalable and efficient data processing systems, optimizing data workflows, and ensuring data quality and integrity.
    3. Monitor and troubleshoot data pipelines to ensure data availability and reliability
    4. Conduct performance tuning and optimization of data processing systems for improved efficiency and scalability
    5. Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.
    6. Work closely with data scientists and analysts to provide them with the necessary data sets and tools for analysis and reporting
    7. Create data tools for analytics team members that assist them in building and optimizing our product into an innovative industry leader.
    8. Stay up-to-date with the latest industry trends and technologies in data engineering and apply them to enhance the data infrastructure


    What You’ll bring

    1. Proven working experience as a Data Engineer with a minimum of 5 years in the field.
    2. Strong programming skills in Scala and experience with Spark for data processing and analytics
    3. Familiarity with Google Cloud Platform (GCP) services such as BigQuery, GCS, Dataproc etc.
    4. Experience of developing near real-time ingestion pipelines using kafka and spark structured streaming.
    5. Experience with Data modelling, Data warehousing and ETL processes
    6. Understanding of data warehousing concepts and best practices
    7. Strong knowledge of SQL and NoSQL systems
    8. Proficiency in version control systems, particularly Git.
    9. Proficiency in working with large-scale data sets and distributed computing frameworks
    10. Familiarity with CI/CD pipelines and tools such as Jenkins or GitLab CI.
    11. Familiarity with schedulers like Airflow.
    12. Strong problem-solving and analytical skills
    13. Familiarity with BI and Visualisation tools like Tableau or Looker
    14. A background in Generative Artificial Intelligence (Gen AI) is desirable but not essential


    Experience :

    5 - 7 years

    Job Domain/Function :

    Data Engineering

    Job Type :

    Hybrid

    Employment Type :

    Full Time

    Number Of Position(s) :

    1

    Educational Qualifications :

    Bachelor's Degree

    Location :

    Bengaluru, Karnataka, India, Bengaluru, Karnataka, India

    Create alert for similar jobs

    WA

    Walmart