
Linux Infrastructure Senior Specialist
- Botafogo - RJ
- Permanente
- Meio período
Administering Linux environments (RHEL, CentOS, Ubuntu and Suse), ensuring high availability and performanceSupporting the infrastructure that serves AI and Machine Learning workloads, including configuring environments with GPU support (NVIDIA, CUDA)Working with advanced troubleshooting, performance analysis and second and third level technical supportCollaborate with global teams, participating in meetings and technical deliveries in EnglishCarry out root cause analysis of incidentsProducing documentation related to the environment (RCA, KBs and others)Acting on incidents relating to their specialty in order to standardize the environmentActing on requests relating to their specialty in order to meet customer demandsPlanning changes to implement corrections, migrations and improvementsImplement corrections, migrations and improvements based on a change planCollecting information on managed environments to meet internal demandsProducing and updating documentation related to the client's production environmentCarrying out the operation, maintenance and documentation of customer equipment with a management product (Managed Services) in accordance with the policyIdentifying opportunities for improvement and new business opportunitiesCarrying out careful analysis of the production environment in order to propose improvementsProviding telephone support to clients for communications related to Incidents, Requests and the preparation of changesSupporting first-level service teams with technical guidance or escalating demandsParticipating in the planning and implementation of HPC solutions, software integration and system administration for new clientsQualificationsPrevious experience in Linux system administration in mission-critical environments with experience in HPC or AIExperience with AI workload environments, including storage integration and troubleshooting (SAN, etc.)Experience with NGFW appliances from manufacturers such as Fortinet, Juniper, Cisco, Checkpoint and SophosAdvanced English (oral and written communication)Experience with Schedulers such as SLURM, LSF, PBS, etc.Knowledge of Kubernetes and/or DockerN2 and N3 service experience, working cross-countryExperience in tshooting connectivity incidents using testing toolsAvailability of working hoursDifferentialsCertifications such as RHCSA, RHCE, LFCS, Docker Certified Associate and CKA.Knowledge of FortigateKnowledge of CUDAKnowledge of JuniperOS , EMC OS , Cumulus OSKnowledge of Docker and KubernetesInfiniband MELLANOXNVDIA DGX ( A100, H100, GB200)NVIDIA Base Command Manager (v10, v11)Equinix is an equal opportunity employer. All candidates will be considered for employment regardless of race, color, religion, creed, national or ethnic origin, ancestry, place of birth, citizenship, sex, pregnancy/childbirth or related medical conditions, sexual orientation, gender identity or expression, marital status or partnership, age, veteran or military status, physical or mental disability, medical condition, information, political/organizational affiliation, status as a victim or family member of a victim of crime or, or any other status protected by applicable law.Equinix is committed to ensuring that our employment process is open to all individuals, including those with a disability. If you are a qualified candidate and need assistance or an accommodation, please let us know by completing .Equinix is an Equal Employment Opportunity and, in the U.S., an Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to unlawful consideration of race, color, religion, creed, national or ethnic origin, ancestry, place of birth, citizenship, sex, pregnancy / childbirth or related medical conditions, sexual orientation, gender identity or expression, marital or domestic partnership status, age, veteran or military status, physical or mental disability, medical condition, genetic information, political / organizational affiliation, status as a victim or family member of a victim of crime or abuse, or any other status protected by applicable law.