




Job Summary: Designs, develops, and maintains scalable data pipelines, optimizes ETL/ELT processes, and ensures data quality and availability while collaborating with teams and clients. Key Highlights: 1. Design and maintain scalable and efficient data pipelines. 2. Collaborate with teams and clients to deliver technical and value-driven solutions. 3. Leverage generative AI and intelligent agents to accelerate development. **Company Description** Inetum is a global technology services and digital innovation company present in more than 26 countries. In Colombia, Inetum delivers advanced technological solutions that drive digital transformation for enterprises across diverse sectors, offering services in consulting, software development, systems integration, outsourcing, and technical support. With a focus on agility and adaptability, Inetum Colombia aims to create value for its clients by implementing innovative technologies that enhance efficiency and productivity. Our team comprises experts across various technological domains, working in a collaborative, dynamic, and results-oriented environment. Join Inetum and become part of a company transforming the future of businesses through technology. **Job Description** **Responsibilities** * Design, develop, and maintain **scalable and efficient data pipelines**. * Build and optimize **ETL/ELT processes** for data ingestion, transformation, and loading from multiple sources. * Ensure **data quality, consistency, traceability, and availability**. * Process large volumes of structured and unstructured data. * Collaborate with analytics, data science, and business teams to understand requirements and translate them into technical solutions. * Participate in defining cloud-based data architectures. * Leverage **generative AI and intelligent agents** to accelerate development, analysis, and problem resolution. * Document developments, workflows, and best practices. * Interact directly with clients, understanding their needs and proposing value-driven solutions. **Requirements** **Requirements** * Solid experience with **SQL databases**. * Knowledge and experience with **relational and non-relational databases** (PostgreSQL, MySQL, SQL Server, MongoDB, DynamoDB, or others). * Proficiency in **Python** and **Scala** for data manipulation and processing. * Practical experience with **Apache Spark** (Spark SQL, DataFrames, job optimization). * Knowledge of **data transformation, cleaning, validation, and enrichment**. * Familiarity with **ETL/ELT tools and frameworks** (Airflow, AWS Glue, DBT, or others). * Understanding of storage architectures such as **Data Lake / Lakehouse**. * Experience with **AWS cloud services** (S3, EMR, Glue, Lambda, Redshift, Athena, etc.). * Knowledge of Control M and Data X. * Proficiency in **version control (Git / GitHub)** and sound development practices. * **Knowledge of AI agents and use of generative AI tools** for technical support and productivity, such as: + **GitHub Copilot** for coding assistance and code refactoring. + **GPT (OpenAI)** for code generation, data analysis, documentation, and troubleshooting. + **Gemini (Google)** or other large language model platforms for technical exploration and support. + Basic understanding of how to integrate or consume **LLMs via APIs** for analytical or automation use cases. * Java experience or knowledge is a plus. **Additional Information** * Strong **analytical capability**, with emphasis on data accuracy and quality. * **Clear and effective communication**, both with technical and non-technical stakeholders. * Experience working **directly with clients**, gathering requirements and managing expectations. * Proactivity and commitment to continuous improvement. * Ability to work collaboratively in multidisciplinary teams. * Autonomy, organization, and accountability in delivering results. * Adaptability to dynamic environments and emerging technologies.


