NOTE: No c2c or corp-corp, no 1099) ONLY accepting candidates that can work on iSpace W2!!!!
Job Description:
Daily Tasks Performed
Develop and Maintain Data Integration Solutions:
Design and implement data integration workflows using AWS Glue/EMR, Lambda, Redshift
Demonstrate proficiency in PySpark, Apache Spark and Python for data processing large datasets
Ensure data is accurately and efficiently extracted, transformed, and loaded into target systems.
Ensure Data Quality and Integrity:
Validate and cleanse data to maintain high data quality.
Ensure data quality and integrity by implementing monitoring, validation, and error handling mechanisms within data pipelines
Optimize Data Integration Processes:
Enhance performance, optimization of data workflows to meet SLAs, scalability of data integration processes and cost-efficiency on AWS cloud infrastructure.
Identify and resolve performance bottlenecks, fine-tuning queries, and optimizing data processing to enhance Redshift's performance
Regularly review and refine integration processes to improve efficiency.
Support Business Intelligence and Analytics:
Translate business requirements to technical specifications and coded data pipelines
Ensure timely availability of integrated data for business intelligence and analytics.
Collaborate with data analysts and business stakeholders to meet their data requirements.
Maintain Documentation and Compliance:
Document all data integration processes, workflows, and technical & system specifications.
Ensure compliance with data governance policies, industry standards, and regulatory requirements.