ML-focused Site Reliability Engineer

2 weeks ago


Bucharest, Bucureşti, Romania Adobe Full time

Our Company

Changing the world through digital experiences is what Adobe's all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. 

We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours

The Opportunity

We have a fantastic opportunity for a ML-focused Site Reliability Engineer to join our Developer Platforms team based in Bucharest.

We are looking for an engineer with hands-on experience in machine learning, including designing and training models for real-world applications. The ideal candidate will play a crucial role in developing and implementing anomaly detection systems to proactively spot and address operational issues in intricate infrastructures. This role demands a strong understanding of AI Ops methodologies to optimize performance, automate incident response, and enhance system reliability. Candidates should be enthusiastic about using data to drive intelligent automation and improve service resilience at scale.

The Role:

  • Build outstanding things that matter. You'll work on a critical growth initiative, solving problems for engineers and customers.

  • Grow. Sharpen your skills, use innovative technology, and collaborate with your peers.

  • Collaborate. Work in an environment that values collaboration.

What you'll do:

  • Ensure the highest level of uptime and Quality of Service (QoS) to Adobe's customers through operational excellence

  • Architect and build an AI Anomaly-detection system that works on Adobe's observability data at scale, partnering with other teams to work across boundaries.

  • Define service level objectives (SLOs) and service level indicators (SLIs) to represent and measure service quality

  • Identify areas to improve service resiliency through techniques such as chaos engineering, performance/load testing, anomaly detection, etc

  • Support and maintain globally distributed multi-cloud (public and/or private) environments

  • Automate common, repeatable tasks at a large scale to reduce toil

  • Tackle performance and stability issues using a wide variety of tools

  • Participate in an on-call rotation as required

  • Determine the root cause for all production level incidents and write corresponding high-quality RCA reports

What you'll need to succeed:

  • Hands-on experience with AI anomaly detection and training models

  • Expert in MCP integration, with experience in MCP to MCP communication as a nice to have.

  • Understanding of how to fine-tune signals from observability systems to allow our AI capabilities to scale for Production data.

  • Deep understanding of both software engineering and technical operations

  • DevOps skills (scrum/Kanban/agile/ci-cd/12-factor)

  • Experience in modern cloud-based, SaaS delivery technologies: AWS, Azure, Jenkins, Git, Atlassian Jira and Confluence, Linux, DNS, E-mail, containers, log analysis, monitoring, Java, Apache, Tomcat, Memcached, Qpid, and MySQL on Linux, Prometheus, Grafana, New Relic, Splunk.

  • Expertise with containerization orchestration engines (Kubernetes)

  • Programming skills, particularly with Python, Java, and Ruby

  • Applied skills in machine learning

  • Excellent communication, interpersonal, and teamwork skills

  • Familiar with a variety of cloud and automation concepts, practices, and procedures

Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law. Learn more.

Adobe aims to make accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email  or call



  • Bucharest, Bucureşti, Romania Vodafone Full time

    Your day to day:The Site Reliability Engineer, as a part of the Product Development team, is responsible to ensure the availability and performance of our application portfolio, toolset, and platform, as well as helping to drive improvements and ehancements at scale while automating processes linked with building and deploying to AWS Cloud.Also the SR...


  • Bucharest, Bucureşti, Romania nShift Full time

    About UsnShiftis the leading global provider of cloud delivery management solutions (SaaS), we enable the frictionless shipment and return of almost one billion shipments across 190 countries each year. We are headquartered in London and Oslo and have over 500 employees across offices in Sweden, Finland, Norway, Denmark, the United Kingdom, Poland, the...

  • ML Ops Engineer

    2 weeks ago


    Bucharest, Bucureşti, Romania Data Edge Full time

    Contract Type: Freelance SRL or PFA/RemoteContracting period: 6 monthsRole OverviewWe are looking for a skilledMLOps Engineerto join our data and machine learning initiatives. In this role, you will be responsible for deploying, operating, and scaling machine learning models in production environments, ensuring reliability, performance, and seamless...


  • Bucharest, Bucureşti, Romania Salt Bank Full time

    We are looking for an exceptional Site Reliability Engineer who lives and breathes system reliability atSalt Bank– one of the fastest-growing neobanks revolutionizing the Romanian financial services landscape. About Us: AtSalt Bank, we're not just another fintech company. We're building the future of banking, aiming to serve millions of customers who...

  • ML Ops Engineer

    2 weeks ago


    Bucharest, Bucureşti, Romania Shape Your Future with Us Full time

    Contract Type: Freelance SRL or PFA/RemoteContracting period: 6 monthsRole Overview We are looking for a skilled MLOps Engineer to join our data and machine learning initiatives. In this role, you will be responsible for deploying, operating, and scaling machine learning models in production environments, ensuring reliability, performance, and seamless...


  • Bucharest, Bucureşti, Romania Worldline Global Full time

     Bucharest, RomaniaThis is Worldline.Worldline helps businesses of all shapes and sizes to accelerate their growth journey - quickly, simply, and securely. We are the innovators at the heart of the payments technology industry, shaping how the world pays and gets paid. Our technology powers the growth of millions of businesses across 5 continents. And just...


  • Bucharest, Bucureşti, Romania Worldline Global Full time

     Bucharest, RomaniaThis is Worldline.Worldline helps businesses of all shapes and sizes to accelerate their growth journey - quickly, simply, and securely. We are the innovators at the heart of the payments technology industry, shaping how the world pays and gets paid. Our technology powers the growth of millions of businesses across 5 continents. And just...


  • Bucharest, Bucureşti, Romania Thales Full time

    Location: Bucharest, RomaniaThales is a global technology leader trusted by governments, institutions, and enterprises to tackle their most demanding challenges. From quantum applications and artificial intelligence to cybersecurity and 6G innovation, our solutions empower critical decisions rooted in human intelligence. Operating at the forefront of defence...


  • Bucharest, Bucureşti, Romania ING Hubs Romania Full time

    Discover ING Hubs RomaniaING Hubs Romania offers 130 services in software development, data management, non-financial risk & compliance, audit, and retail operations to 24 ING units worldwide, with the help of 𝐨𝐯𝐞𝐫 𝟐𝟎𝟎𝟎 𝐡𝐢𝐠𝐡-𝐩𝐞𝐫𝐟𝐨𝐫𝐦𝐢𝐧𝐠 𝐞𝐧𝐠𝐢𝐧𝐞𝐞𝐫𝐬, 𝐫𝐢𝐬𝐤,...


  • Bucharest, Bucureşti, Romania Brightgrove Full time

    ABOUT THE CLIENTThis client delivers digital and AI-driven solutions specifically for the life sciences and healthcare industries.The company provides end-to-end engineering, informatics, and data science services. Our customer utilizes scientific and technical expertise to build scalable, secure, and compliant digital solutions.PROJECT DETAILSThis position...