Pyspark Data Engineer

Details of the offer

The Applications Development Senior Supervisor is an intermediate management level position responsible for providing full leadership and direction to a team of employees in an effort to establish and implement new or revised data platform eco systems and programs in coordination with the Technology team. The overall objective of this role is to lead data engineering systems analysis and programming activities.

Responsibilities:
Build and maintain batch or real-time data pipelines in data platform.
Maintain and optimize the data infrastructure required for accurate extraction, transformation, and loading of data from a wide variety of data sources.
Develop ETL (extract, transform, load) processes to help extract and manipulate data from multiple sources.
Monitor and control all phases of development process and analysis, design, construction, testing, and implementation as well as provide user and operational support on applications to business users
Automate data workflows such as data ingestion, aggregation, and ETL processing.
Prepare raw data in Data Warehouses into a consumable dataset for both technical and non-technical stakeholders.
Monitor data systems performance and implement optimization solution.
Has the ability to operate with a limited level of direct supervision.
Can exercise independence of judgement and autonomy.
Acts as SME to senior stakeholders and /or other team members.
Serve as advisor or coach to new or lower level analysts
Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behaviour, conduct and business practices, and escalating, managing and reporting control issues with transparency.

Qualifications:
5+ years of relevant experience in Data engineering role
Advanced SQL skills and experience with relational databases and database design.
Strong experience in object-oriented languages: Python, PySpark is must
Experience working with data ingestion tools such as Talend & Ab Initio.
Experience working with data lakehouse architecture such as Iceberg/Starburst
Strong experience in scripting languages like Bash.
Strong experience in data pipeline and workflow management tools
Strong experience in scripting languages like Bash.
Excellent problem-solving, communication, and organizational skills.
Proven ability to work independently and with a team.
Experience in managing and implementing successful projects
Ability to adjust priorities quickly as circumstances dictate
Consistently demonstrates clear and concise written and verbal communication

Education:
Bachelor's degree/University degree or equivalent experience

This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.
The Applications Development member (data engineering senior programmer) is an intermediate level position responsible for participation in the establishment and implementation of new or revised data platform eco systems and programs in coordination with the Technology team. The overall objective of this role is to contribute to data engineering scrum team to implement the business requirements:

Responsibilities:
Build and maintain batch or real-time data pipelines in data platform.
Maintain and optimize the data infrastructure required for accurate extraction, transformation, and loading of data from a wide variety of data sources.
Develop ETL (extract, transform, load) processes to help extract and manipulate data from multiple sources.
Monitor and control all phases of development process and analysis, design, construction, testing, and implementation as well as provide user and operational support on applications to business users
Automate data workflows such as data ingestion, aggregation, and ETL processing.
Prepare raw data in Data Warehouses into a consumable dataset for both technical and non-technical stakeholders.
Build, maintain, and deploy data products for analytics and data science teams on data platform
Ensure data accuracy, integrity, privacy, security, and compliance through quality control procedures.
Monitor data systems performance and implement optimization solution.
Has the ability to operate with a limited level of direct supervision.
Can exercise independence of judgement and autonomy.
Acts as SME to senior stakeholders and /or other team members.
Serve as advisor or coach to new or lower level analysts
Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency.

Qualifications:
5+ years of relevant experience in Data engineering role
Advanced SQL/ RDBMS skills and experience with relational databases and database design.
Strong proficiency in object-oriented languages: Python, PySpark is must
Experience working with Bigdata - Hive/Impala/S3/HDFS
Experience working with data ingestion tools such as Talend or Ab Initio.
Nice to working with data lakehouse architecture such as AWS Cloud/Airflow/Starburst/Iceberg
Strong proficiency in scripting languages like Bash, UNIX Shell scripting
Strong proficiency in data pipeline and workflow management tools
Strong project management and organizational skills.
Excellent problem-solving, communication, and organizational skills.
Proven ability to work independently and with a team.
Experience in managing and implementing successful projects
Ability to adjust priorities quickly as circumstances dictate
Consistently demonstrates clear and concise written and verbal communication

Education:
Bachelor's degree/University degree or equivalent experience

------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Applications Development------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Citi is an equal opportunity and affirmative action employer.

Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Citigroup Inc. and its subsidiaries ("Citi") invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity reviewAccessibility at Citi.

View the "EEO is the Law" poster. View theEEO is the Law Supplement.
View theEEO Policy Statement.
View thePay Transparency Posting


Source: Eightfold_Ai

Requirements

Infrastructure Specialist: System Administration

As an Infrastructure Specialist at IBM, you will support the infrastructure running industries likes transportation, energy, insurance, banking, or healthcar...


From Ibm Careers - Maharashtra

Published a month ago

Security Consultant-Network Security

As a Network Security Engineer, you are expected to work on Networking products or solutions based on any vendor hardware /vendor operating system software (...


From Ibm Careers - Maharashtra

Published a month ago

Package Consultant: Sap Hana Scm Pm

As a Consultant you will serve as a client-facing practitioner who sells, leads and implements expert services utilizing the breadth of IBM's offerings and t...


From Ibm Careers - Maharashtra

Published a month ago

Application Architect: Mobile

Software Development Life Cycle (SDLC) framework, IT Service Management procedures, development solutions which run on multiple platforms. may be composed of...


From Ibm Careers - Maharashtra

Published a month ago

Built at: 2024-11-01T02:22:43.767Z