Citi company logo

Python Full Stack Data Engineer - Assistant Vice President at Citi

CitiVerified

Get jobs like this by email

First name, email, subscribe.

Job Details

Status
Active
Posted
May 18, 2026
Expires
Aug 16, 2026
Work style
Hybrid

Share with someone qualified

About the Role

We are assembling an A-team of highly skilled, autonomous, and AI-first engineers, and we are seeking an exceptional Full Stack Data Engineer to join our high-performing, co-located squads in Canada. This role is for a hands-on engineer who is passionate about leveraging data, proficient in building end-to-end data solutions, and deeply committed to using AI tools to maximize productivity. The ideal candidate will be instrumental in designing, developing, and optimizing robust data pipelines, from ingestion to consumption, using Python, PySpark, and other big data technologies. We are looking for an AI-first thinker who can profoundly understand the functional domains our work impacts, and significantly contribute to our data strategy and culture.

Responsibilities:

  • Operate end-to-end in the design, development, and implementation of full-stack data solutions, ensuring optimal performance, scalability, data quality, security, and compliance across the data lifecycle.
  • Collaborate closely within small, co-located squads (4-7 person teams), fostering an environment of high communication and minimal coordination overhead, to deliver impactful data products.
  • Develop, maintain, and optimize highly efficient and resilient data ingestion, processing, and transformation pipelines using advanced Python and PySpark techniques for large-scale datasets.
  • Implement sophisticated data storage solutions leveraging a diverse set of big data technologies including Hive, distributed file systems (e.g., HDFS, S3), and enterprise-grade NoSQL databases (e.g., Cassandra, MongoDB).
  • Design and implement scalable data models and schemas that support advanced analytics, machine learning, and critical reporting needs, ensuring data integrity, accessibility, and discoverability.
  • Engage effectively with data consumers, data scientists, and business stakeholders to deeply understand their requirements, translating them into robust data solutions and providing expert guidance on data utilization and interpretation.
  • Implement real-time data streaming and complex event-driven architectures using technologies like Apache Kafka, ensuring low-latency data availability for critical business functions.
  • Adhere to and contribute to best practices in data engineering and software development, participating in rigorous code reviews, implementing comprehensive automated testing strategies, and supporting robust CI/CD pipelines within a DevOps culture.
  • Exhibit High Autonomy and Agency, taking ownership of technical challenges, making well-reasoned architectural decisions, and proactively identifying and implementing continuous improvements across the data landscape.
  • Innovate with AI-Powered Development, actively leveraging, integrating, and contributing to AI coding tools (e.g., internal Citi AI tools, Copilot, Claude Code, Codex, Antigravity) to significantly enhance productivity, code quality, and development velocity, and inspiring others to do the same.
  • Participate in technical discussions and contribute to the evolution of our big data technology stack, evaluating new technologies, and making strategic recommendations that align with business objectives and architectural vision.
  • Expertly Troubleshoot and Resolve challenging technical issues within complex, distributed big data environments, applying advanced analytical and problem-solving methodologies.

Required Skills & Experience:

  • Experience: 4+ years of progressive, hands-on experience as a Data Engineer, with a proven track record of delivering complex, large-scale data solutions.
  • Programming Languages:
    • Expert-level proficiency in Python, with deep expertise in developing highly optimized, scalable, and production-grade PySpark applications for mission-critical data processing.
  • Big Data Frameworks/Technologies:
    • Deep understanding and extensive hands-on experience with the entire Apache Spark ecosystem (Spark Core, Spark SQL, Spark Streaming).
    • Advanced proficiency with Hive for enterprise data warehousing, including optimization techniques for large and complex queries.
    • Expert knowledge of distributed computing fundamentals, HDFS, and other components of the Hadoop ecosystem.
  • Data Storage & Management:
    • Proficiency in SQL, complex query optimization, and advanced data warehousing concepts (e.g., dimensional modeling, data vault, data lakes).
    • Extensive experience with various data storage formats (e.g., Parquet, ORC, Avro) and leading data lake solutions (e.g., Delta Lake, Iceberg).
    • Proven experience with enterprise-grade NoSQL databases (e.g., Cassandra, MongoDB, HBase) and understanding of their architectural trade-offs.
  • Messaging & Event Streaming:
    • Expert-level experience with Apache Kafka, including design and implementation of high-throughput, low-latency real-time data pipelines and event-driven architectures.
  • Cloud Platforms:
    • Extensive experience with big data services on major cloud platforms (e.g., AWS EMR/Glue/Redshift/Kinesis, Azure Databricks/Data Factory/Synapse/Event Hubs, GCP Dataflow/Dataproc/BigQuery/Pub/Sub), including cloud-native architectural patterns.
  • AI-Powered Development & Productivity:
    • Mandatory: Demonstrated mastery and innovative application of AI coding tools (e.g., Claude Code, Codex, Antigravity) to significantly enhance the development lifecycle.
    • A proactive, "AI-first thinker" mindset, with a proven ability to evaluate, integrate, and evangelize new AI tools and methodologies within the team to drive continuous improvement and innovation.
  • Domain Understanding:
    • Expert ability to articulate the intricacies of the functional domain, proactively identifying business challenges and opportunities, and translating them into impactful, data-driven solutions.
  • Other Essential Skills:
    • Advanced understanding of software engineering principles, design patterns, data structures, algorithms, and performance engineering for distributed systems.
    • Extensive experience with RESTful API design, development, and integration for data services.
    • Strong expertise in containerization technologies (e.g., Docker, Kubernetes) and orchestration for deploying and managing scalable data applications.
    • Master-level proficiency with version control systems, especially Git, including advanced branching, merging, and code review strategies.
    • Exceptional problem-solving, analytical, and debugging skills applied to highly complex, distributed big data ecosystems.
    • Superior communication, presentation, and interpersonal skills, with the ability to articulate complex technical concepts to diverse audiences and influence strategic decisions.
  • Demonstrated high autonomy and agency in driving strategic initiatives and delivering impactful, innovative data solutions.
  • Education:

    • Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or a related quantitative field is required. Equivalent advanced practical experience with a demonstrable track record of architecting and delivering major data initiatives will also be considered.

    ------------------------------------------------------

    Job Family Group:

    Technology

    ------------------------------------------------------

    Job Family:

    Applications Development

    ------------------------------------------------------

    Time Type:

    Full time

    ------------------------------------------------------

    Primary Location Full Time Salary Range:

    $94,300.00 - $141,500.00

    ------------------------------------------------------

    Most Relevant Skills

    Please see the requirements listed above.

    ------------------------------------------------------

    Other Relevant Skills

    For complementary skills, please see above and/or contact the recruiter.

    ------------------------------------------------------

    Automated Processing and AI

    We use automated processing, including artificial intelligence, for our legitimate business interests (or our reasonable and appropriate business purposes) to identify and align the candidate's skills and abilities with a specific job opening. Additionally, if you so choose, or consent, we can match your skills and abilities to other suitable roles at Citi.

    Importantly, all our hiring processes and decisions, including determining your suitability for a role, are conducted, checked, and decided by individuals. Our automated processing and AI do not involve relying on automatic or autonomous decision-making. Please refer to any Jurisdictional Considerations, with specific provisions for your country (where relevant) for further details.

    ------------------------------------------------------

    This job opening is for an existing job vacancy.

    ------------------------------------------------------

    Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

    If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

    View Citi’s EEO Policy Statement and the Know Your Rights poster.

    CV Match Tool

    Check if your CV matches this job before applying

    This job accepts direct applications - no recruiter in between. Posted May 18, 2026.

    Apply on Company Site

    More Jobs in Mississauga Ontario Canada

    Remote Jobs in Mississauga Ontario Canada

    No same-location remote jobs were found, so here are remote Data Science & Analytics jobs from other countries.

    Articles You May Like

    • Best Cybersecurity Certifications in 2026 You Should Have to Land a Job

      AI and Automation Jun 9, 2026

      Cybersecurity certifications are more popular than ever, but many professionals are chasing the wrong credentials for their career goals. In 2026, the smartest move isn't collecting certificates; it's choosing the one that aligns with the job you actually want. From Security+ and CISSP to CCSP, CISM, OSCP, and GIAC, here's what matters most before you invest your time and money.

    • How to Become an AI Engineer in 2026

      Career Advice Jun 7, 2026

      AI engineering in 2026 is no longer just about learning Python or training machine learning models. Companies want people who can build real AI systems, integrate them into products, evaluate their performance, and ensure reliability. Here’s why most beginners are preparing the wrong way, and what to focus on instead.

    • ChatGPT Skills for Jobs in 2026

      AI and Automation Jun 6, 2026

      As ChatGPT becomes a must-have workplace tool in 2026, many job seekers are focusing on the wrong skills. In this article, I explain why employers care less about memorized prompts and more about AI workflow thinking, the ability to use ChatGPT to research, analyze, verify, organize, and produce real business outcomes.

    • Why AI Skills Are Becoming the New Career Filter

      AI and Automation Jun 4, 2026

      AI is no longer just a bonus skill. In 2026, employers are looking for workers who can use AI to improve real work, not just generate quick answers. This article explains why prompt writing is only the beginning — and why skills like workflow design, AI evaluation, data judgment, risk awareness, and domain expertise are becoming essential for career growth.

    • Countries Best for Remote Workers in 2026

      Career Advice May 7, 2026

      With 56 countries now competing for remote workers, the decision isn't about finding the "best" destination, it's about understanding where your income level, tax situation, and work style actually align.

    Related Jobs

    More jobs in Data Science & Analytics that are worth reviewing next.

    Recently Posted Jobs

    Fresh openings users can continue browsing from here.