Data Engineer

Mid / Senior

Remote

Join our diverse and inclusive team where you will feel valued and motivated to contribute with your unique skills and experience. Exavalu offers permanent remote working model as we believe in going where the right talent is.

Key Responsibilities:

Develop, implement, support, and operationalize AWS data lake infrastructure & services.
Create and maintain optimal data pipeline architecture,
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL, ETL (e.g., Informatica Cloud) and AWS ‘Data Lake’ technologies.
Build analytics tools that utilize the data pipeline to provide actionable insights into patient care, operational efficiency, and other key business performance metrics.
Develop a deep understanding of AWS’s vast data sources and know exactly how, when, and which data to use to solve business problems.
Monitor and maintain data lake security and data lake services.
Manage numerous requests concurrently and strategically, prioritizing when necessary.
Troubleshoot technical issues and provide solutions and fixes using various tools and information such as server logs and report debug logs.
General and administrative tasks

Desired Profile:

Bachelor’s degree in computer science, Information Systems, Mathematics, or a related discipline.
4+ years of experience in Information Technology within a complex, matrixed, and global business environment.
Experienced as a data engineer with AWS Data Lake Technologies and Services
Expert working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
Building and optimizing AWS ‘Data Lake’ data pipelines, architectures, and data sets.
Strong analytic skills related to working with unstructured datasets.
Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
Understanding of message queuing, stream processing, and highly scalable AWS ‘Data Lake’ data stores.
Understanding of database and analytical technologies in the industry including MPP and NoSQL databases (e.g., Snowflake), Data Warehouse design, ETL, BI reporting and Dashboard development.
Experience with Agile framework and DevOps.
3+yrs of experience in building ETL data pipelines using AWS Glue and Pyspark.
Efficient in developing Spark scripts for data ingestion, aggregation, and transformation.
Exception Handling and performance optimization techniques on python/pyspark scripts

{"group":"Organization","title":"Data Engineer","endDate":"2023-11-09T19:51:26.283Z","isDraft":false,"jobType":"Full Time","job_url":"701-exavalu-data-engineer","agencyId":1,"clientId":"35","location":[{"lat":20.593684,"lon":78.96288,"zip":"","city":"","text":"India","state":"","country":"India","is_city":false,"is_state":false,"is_country":true,"state_code":"","countryCode":"IN","isLocationSet":true,"isLocationResolved":true}],"maxSalary":"","minSalary":"","startDate":"2023-11-09T19:51:26.283Z","currencyIn":"USD","hiringSPOC":"Web Imitation ","onBehalfOf":"62","companyName":"Meytier","description":"Join our diverse and inclusive team where you will feel valued and motivated to contribute with your unique skills and experience. Exavalu offers permanent remote working model as we believe in going where the right talent is. Key Responsibilities:<ul><li>Develop, implement, support, and operationalize AWS data lake infrastructure & services.</li><li>Create and maintain optimal data pipeline architecture,</li><li>Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL, ETL (e.g., Informatica Cloud) and AWS ‘Data Lake’ technologies.</li><li>Build analytics tools that utilize the data pipeline to provide actionable insights into patient care, operational efficiency, and other key business performance metrics.</li><li>Develop a deep understanding of AWS’s vast data sources and know exactly how, when, and which data to use to solve business problems.</li><li>Monitor and maintain data lake security and data lake services.</li><li>Manage numerous requests concurrently and strategically, prioritizing when necessary.</li><li>Troubleshoot technical issues and provide solutions and fixes using various tools and information such as server logs and report debug logs.</li><li>General and administrative tasks</li></ul>Desired Profile:<ul><li>Bachelor’s degree in computer science, Information Systems, Mathematics, or a related discipline.</li><li>4+ years of experience in Information Technology within a complex, matrixed, and global business environment.</li><li>Experienced as a data engineer with AWS Data Lake Technologies and Services</li><li>Expert working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.</li><li>Building and optimizing AWS ‘Data Lake’ data pipelines, architectures, and data sets.</li><li>Strong analytic skills related to working with unstructured datasets.</li><li>Build processes supporting data transformation, data structures, metadata, dependency, and workload management.</li><li>A successful history of manipulating, processing, and extracting value from large, disconnected datasets.</li><li>Understanding of message queuing, stream processing, and highly scalable AWS ‘Data Lake’ data stores.</li><li>Understanding of database and analytical technologies in the industry including MPP and NoSQL databases (e.g., Snowflake), Data Warehouse design, ETL, BI reporting and Dashboard development.</li><li>Experience with Agile framework and DevOps.</li><li>3+yrs of experience in building ETL data pipelines using AWS Glue and Pyspark.</li><li>Efficient in developing Spark scripts for data ingestion, aggregation, and transformation.</li><li>Exception Handling and performance optimization techniques on python/pyspark scripts</li></ul>","isHybridJob":false,"isRemoteJob":true,"titleSkills":[{"keyword":"Data Engineer","node_id":"i10246","removed":false,"node_ptr":[["meytier_root","information technology","systems engineering","software engineering","software development","data management","data engineering"]],"must_have":false,"node_name":"data engineering","not_a_skill":false,"nice_to_have":false,"is_industry_term":false,"gender_threshold_yn":"balanced","final_node_fft_weights":{"data management":1},"final_node_skarea_basetype":""}],"dateRecieved":"20211019000000","exclusiveJob":false,"isJDVerified":true,"hiringManager":"Web Imitation ","maxExperience":"8","minExperience":"4","isOnPremiseJob":false,"onBehalfOfName":"Exavalu","otherlocations":[],"experienceLevel":"Mid / Senior","annotationStatus":1,"maxSeniorityLevel":6,"minSeniorityLevel":3,"otherJobReference":"","sharpenedJobTitle":"Data Engineer","job_category_group":3,"portalLocationDisplay":"India or Remote","educationQualification":"Baccalaureate Degree","normalized_title_object_new":{"in_use":true,"skills":["data engineering"],"industry":["ALL INDUSTRIES"],"root_role":"data engineer","is_root_role":true,"matched_with":"data engineer","department_team":"software development","normalized_title":"data engineer","reason_for_match":"BTM","frequently_found_skills":[],"inormalized_titles_master_new":"26","match_inside_length_threshold":true,"normalized_title_display_name":"Data Engineer"},"expertise_coreskill_or_product":["Other"],"expertise_functional_area_hiring_team":"","job_id":"2681"}