Job Description
Overview
Looking for an astute, proficient and qualified Senior Data Engineer to assess, analyze and work with data concepts, use-cases & complex new data sources to provide business insights to customers and support the implementation & integration of the data sources into the Presight platform.
The opportunity
To play a critical role in the development of high-performance data solutions & information products at Presight.
Presight is UAE’s leading Analytics-driven, Cloud first, AI-enabled company with a deep focus on driving digital transformation in the MENACA region to power the next generation of cities, businesses, and industries. When it comes to challenges faced by organizations in the MENACA regions, Presight has in-depth knowledge and expertise. Presight enables public and private sector stakeholders to make analytics-driven, intelligent decisions. Its solutions help steer digital transformation and simplify its customers’ challenges across various industries, including digital governments, national security, national cloud, healthcare, financial sectors, and infrastructure.
Founded in 2020, the company is headquartered in Abu Dhabi, UAE. Presight has developed transformative products/solutions which enable its customers to gain a stark competitive advantage by integrating AI, deep Analytics and ML in their digital transformation. The team of over +190 staff includes domain specialists, operations experts, data scientists, solution architects, software developers, engineers and data analysts from different countries – all committed to delivering impact.
Presight is an operating company of G42, UAE’s leading AI and Cloud Computing company that champions AI as drivers that power progress, propelled by the combination of exceptional people and technology.
Responsibilities
Key responsibilities
The candidate is expected to have a solid background on software development with strong python coding skills and solve challenging problems.
Developing Data pipelines with Cloud Services & On-premise Data Centers.
Web crawling, data cleaning, data annotation, data ingestion and data processing.
Reading and collating complex data sets
Creating and maintaining data pipelines
Continual focus on process improvement to drive efficiency and productivity within the team
Use of Python, SQL, ES, Shell etc. to build the infrastructure required for optimal extraction, transformation, and loading of data
Provide insights into key business performance metrics by building analytical tools that utilize the data pipeline
Support the wider business with their data needs on an ad hoc basis
Open to extensive international business travel as and when required, and for extended periods
Skills And Attributes For Success
The yardsticks of your performance would include cost savings achieved, number of data systems adhering to architecture standards, adherence to policies, procedures & quality standards, number of innovations/POCs deployed, database throughput, and customer NPS.
Qualifications
To qualify, you must have
4+ years of programming experience, solid coding skills in Python, Shell, and Java
Bachelor’s degree in computer engineering, Computer Science, or Electrical Engineering and Computer Sciences.
Strong practical knowledge in data processing and migration tools, such as Apache NiFi, Kafka, and Spark.
Design, build, and maintain data processing with CDP(Cloudera Data Platform) Private Cloud.
Develop and Maintain Data Workflow with Apache Airflow.
Experience with HDFS or Similar Object Storage
Strong Understanding about Distribute Computing and Distributed Systems
Experience with Web crawling, cleaning.
Experience with solution architecture, data ingestion, query optimization, data segregation, ETL, ELT, AWS, EC2, S3, SQS, lambda, Elastic Search, Redshift, CI/CD frameworks and workflows.
Working knowledge of data platform concepts – data lake, data warehouse, ETL, big data processing (designing and supporting variety/velocity/volume), real time processing architecture for data platforms, scheduling and monitoring of ETL/ELT jobs
PostgreSQL and programming (preferably Java, Python), proficiency in understanding data, entity relationships, structured & unstructured data, SQL and NoSQL databases
Knowledge of best practice in optimizing columnar and distributed data processing system and infrastructure
Experienced in designing and implementing dimensional modelling
Knowledge of machine learning and data mining techniques in one or more areas of statistical modelling, text mining and information retrieval.
Be open to extensive international business travel as and when required, and for extended periods
Ideally, you’ll also need
Strong analytical and data visualization skills
In-depth market and domain knowledge
An innovative and creative approach to problem-solving
Excellent communication and presentation skills
What we look for:
If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers in the AI space is at the heart of the Presight community.
What working at Presight offers:
Culture: An open, diverse and inclusive environment with a global vision that encourages personal growth and focuses on ground-breaking, industry-first innovations.
Career: Outstanding learning, development & growth opportunities via structured training programs and innovative, high-tech projects.
Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits and more.
About G42
G42 is a global leader in creating visionary artificial intelligence for a better tomorrow. Born in Abu Dhabi and operating across the world, G42 champions AI as a powerful force for good. Its people are constantly reimagining what technology can do, applying advanced thinking and innovation to accelerate progress and tackle society’s most pressing problems.
G42 is driving change in the region and beyond, joining forces with nations, corporations and individuals to create the infrastructure for tomorrow’s world. From molecular medicine to space travel and everything in between, G42 realizes exponential possibilities, today.
To confidently demonstrate that you meet the criteria above, please contact us.