Act as a Cloudera Data Engineer with high skill on performance software development using spark and java/scala languages.
Design, planning and management spark applications, based on Cloudera cluster technologies and workflows integrated with Kafka streams. Ensuring correct implementation and quality of code.
Support in the analysis of functional requirements to bring them to detailed technical specifications.
Technology that you will work with: KUDU, HDFS, Hive, Impala, Spark, Java, OPEN JDK, Springboot, Openshift, Kafka, Gitlab, Gitlab CI, ArgoCD, Linux servers, SQL Server and much more
What You Bring
Experience in Cloudera ecosystem. Cloudera manager, Ranger, Knox, Atlas, hive…
Experience with: Spark with streaming applications (writing in Scala/Java) using Kafka Streams. Kudu and HDFS data design, Impala queries.
experience with: XML, JSON, ...
Ready to Apply?
This position is 100% remote. Please mention Remote-Jobs.Work when applying.