September 28, 2023

Benjamin Better

Better Get Computer

Big data infrastructure internship | Adaltas

Big data infrastructure internship | Adaltas

Job description

Huge Information and dispersed computing are at the main of Adaltas. We accompagny our partners in the deployment, maintenance, and optimization of some of the premier clusters in France. Due to the fact not too long ago we also give support for working day-day operations.

As a wonderful defender and active contributor of open up resource, we are at the forefront of the knowledge system initiative TDP (TOSIT Info Platform).

In the course of this internship, you will lead to the advancement of TDP, its industrialization, and the integration of new open supply elements and new functionalities. You will be accompanied by the Alliage qualified staff in cost of TDP editor assist.

You will also function with the Kubernetes ecosystem and the automation of datalab deployments Onyxia, which we want to make offered to our shoppers as nicely as to pupils as section of our training modules (devops, huge knowledge, etc.).

Your qualifications will help to develop the products and services of Alliage’s open up resource support giving. Supported open supply elements consist of TDP, Onyxia, ScyllaDB, … For these who would like to do some web work in addition to big details, we previously have a really practical intranet (ticket management, time management, superior lookup, mentions and associated articles, …) but other awesome characteristics are anticipated.

You will follow GitOps release chains and compose article content.

You will get the job done in a crew with senior advisors as mentor.

Organization presentation

Adaltas is a consulting agency led by a group of open up resource gurus focusing on data management. We deploy and work the storage and computing infrastructures in collaboration with our buyers.

Lover with Cloudera and Databricks, we are also open up supply contributors. We invite you to look through our web site and our numerous complex publications to learn a lot more about the business.

Capabilities necessary and to be obtained

Automating the deployment of the Onyxia datalab calls for expertise of Kubernetes and Cloud indigenous. You ought to be at ease with the Kubernetes ecosystem, the Hadoop ecosystem, and the distributed computing product. You will learn how the standard parts (HDFS, YARN, item storage, Kerberos, OAuth, etcetera.) function together to meet up with the employs of big information.

A excellent knowledge of applying Linux and the command line is needed.

Throughout the internship, you will understand:

  • The Kubernetes/Hadoop ecosystem in get to lead to the TDP job
  • Securing clusters with Kerberos and SSL/TLS certificates
  • Large availability (HA) of solutions
  • The distribution of assets and workloads
  • Supervision of products and services and hosted apps
  • Fault tolerant Hadoop cluster with recoverability of lost knowledge on infrastructure failure
  • Infrastructure as Code (IaC) by means of DevOps resources these as Ansible and [Vagrant](/en/tag/hashicorp- vagrant/)
  • Be cozy with the architecture and operation of a knowledge lakehouse
  • Code collaboration with Git, Gitlab and Github


  • Turn into acquainted with the architecture and configuration strategies of the TDP distribution
  • Deploy and check protected and remarkably out there TDP clusters
  • Contribute to the TDP know-how foundation with troubleshooting guides, FAQs and content
  • Actively contribute ideas and code to make iterative enhancements to the TDP ecosystem
  • Study and analyze the differences in between the key Hadoop distributions
  • Update Adaltas Cloud working with Nikita
  • Contribute to the growth of a device to gather buyer logs and metrics on TDP and ScyllaDB
  • Actively contribute thoughts to produce our assist alternative

Supplemental information

  • Site: Boulogne Billancourt, France
  • Languages: French or English
  • Starting off date: March 2023
  • Length: 6 months

Substantially of the digital world runs on Open Source software program and the Large Knowledge field is booming. This internship is an opportunity to get precious encounter in the two domains. TDP is now the only really Open Source Hadoop distribution. This is a great momentum. As part of the TDP group, you will have the likelihood to discover one of the core major info processing styles and take part in the advancement and the foreseeable future roadmap of TDP. We think that this is an remarkable option and that on completion of the internship, you will be all set for a prosperous profession in Significant Knowledge.

Devices offered

A laptop with the subsequent traits:

  • 32GB RAM
  • 1TB SSD
  • 8c/16t CPU

A cluster made up of:

  • 3x 28c/56t Intel Xeon Scalable Gold 6132
  • 3x 192TB RAM DDR4 ECC 2666MHz
  • 3x 14 SSD 480GB SATA Intel S4500 6Gbps

A Kubernetes cluster and a Hadoop cluster.


  • Salary 1200 € / thirty day period
  • Cafe tickets
  • Transportation pass
  • Participation in a single intercontinental convention

In the past, the conferences which we attended incorporate the KubeCon arranged by the CNCF basis, the Open up Resource Summit from the Linux Foundation and the Fosdem.

For any request for supplemental info and to post your application, you should get in touch with David Worms: