Job Title:Data Engineer
We are looking for a hands-on Data Engineer who is passionate about solving business problems through innovation and engineering practices. As a Data Engineer, the candidate will leverage deep technical knowledge andwill apply knowledge of data architecture standards, data warehousing, data structures, and business intelligenceto drive the creation of high-quality data products fordata driven decision making.
Required Qualifications
6+ Years of relevant experience of implementing data-intensive solutions using agile methodologies.
Code contributing member of Agile teams, working to deliver sprint goals.
Write clean, efficient, and maintainable code that meets the highest standards of quality.
Very strong in coding Python/Pyspark, UNIX shell scripting
Experience in cloud native technologies and patterns
Ability to automate and streamline the build, test and deployment of data pipelines
Technical Skills (Must Have)
ETL:Hands on experience of building data pipelines. Proficiency in data integration platforms such as Apache Spark
Experienced in writing Pyspark code to handle large data set ,perform data transformation , familiarity with Pyspark integration with other Apache Spark component ,such as Spark SQL , Understanding of Pyspark optimization techniques
Strong proficiency in working with relational databases and using SQL for data querying, transformation, and manipulation.
Big Data:Exposure to'big data' platforms such as Hadoop, Hive or Iceberg for data storage and processing
Data Warehousing & Database Management: Understanding of Data Warehousing concepts, Relational (Oracle, MSSQL, MySQL) and NoSQL (MongoDB, DynamoDB) database design
Data Modeling & Design:Good exposure to data modeling techniques; design, optimization and maintenance of data models and data structures
Languages: Proficient in one or more programming languages commonly used in data engineering such as Python, PySpark, UNIX Shell scripting
DevOps: Exposure to concepts and enablers - CI/CD platforms, bitbucket/Github, JIRA, Jenkins, Tekton, Harness
Technical Skills (Valuable)
Data Quality & Controls: Exposure to data validation, cleansing, enrichment and data controls, framework libraries like Deequ
Federated Query: Starburst, Trino
Containerization: Fair understanding of containerization platforms like Docker, Kubernetes, Openshift
File Formats: Exposure in working on File/Table Formats such as Avro, Parquet, Iceberg, Delta
Schedulers: Basics of Job scheduler like Autosys, Airflow
Cloud: Experience in cloud native technologies and patterns (AWS, Google Cloud)
Nice to have: Java, for REST API development
Other skills :
Strong project management and organizational skills.
Excellent problem-solving, communication, and organizational skills.
Proven ability to work independently and with a team.
Experience in managing and implementing successful projects
Ability to adjust priorities quickly as circumstances dictate
Consistently demonstrates clear and concise written and verbal communication
Education:
Bachelor's degree/University degree or equivalent experience
------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Applications Development------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Citi is an equal opportunity and affirmative action employer.
Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Citigroup Inc. and its subsidiaries ("Citi") invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity reviewAccessibility at Citi.
View the "EEO is the Law" poster. View theEEO is the Law Supplement.
View theEEO Policy Statement.
View thePay Transparency Posting