2021 University Graduate - SRE Engineer

DESPRE COMPANIE

At Adobe, we’re changing the world. How? We give people the tools to bring their ideas to life and create content that makes life more fun and work more meaningful. We give businesses and organizations the power to truly engage their customers. We're the ones behind the gorgeously designed content that streams across your laptop, TV, phone, and tablet every day—and we’re the ones who harness the massive power of big data to help companies move from data to insight and insight to action by delivering content that people crave most.

We’re a company that understands that product innovation comes…

Site-ul companiei

DE LA ACEEAȘI COMPANIE

Toate stagiile de la Adobe Romania

2021 University Graduate - SRE Engineer

Stagiu plătit la Adobe Romania · Începe după sesiune

Categorii:

– Business Software Development
– Networking
– Software technologies in Automatic Control, Industrial And Systems Engineering

Oraș:

București

Aptitudini necesare:

computer science c++

Site Reliability Engineering has an exciting and challenging mission: Build, deploy, operate, scale and maintain company-wide platforms (PlaaS) for customer facing Adobe SaaS solutions. While various development groups focus on building our platforms, SRE provides operational/engineering support for both the platform as well as the product teams that leverage the platforms. A capable site reliability engineer (SRE) should have one main high-level objective; identify and solve complex problems through software. This is not a traditional sysadmin/operations role (ie deployments, ticket work, dashboarding, monitoring, incident response). A significant portion of time (~50%) will be some form of programming/development work, preferably to solve self-identified problems. This role will work with the various Adobe product engineering teams and will report to the Engineering Manager of Site Reliability Engineering group.

Areas of Responsibility:

Ensure the highest level of uptime and Quality of Service (QoS) to Adobe’s customers through operational excellence
Define service level objectives (SLOs) and service level indicators (SLIs) to represent and measure service quality
Embed with product teams (physically and/or virtually) to foster strong collaboration/partnership
Identify areas to improve service resiliency through techniques such as chaos engineering, performance/load testing, etc
Support and maintain globally distributed, multi-cloud (public and/or private) environments
Automate common, repeatable tasks at large scale to streamline operational procedures
Design and maintain production monitoring systems
Troubleshoot performance and stability issues using a wide variety of tools
Evaluate and manage application and environment security
Follow change management processes during implementations
Use and maintain version control for application infrastructure
Work in a diverse and global team environment
Cross-train with other global team members
Participate in an on-call rotation as required
Determine root-cause for all production level incidents and write corresponding high-quality RCA reports
Promote the DevOps/SRE mindset

What you will bring:

Experience with distributed applications at scale in public cloud (AWS and/or Azure)
Experience in one (and preferably more) of the following languages: C, C++, Java, Python, Go, Perl or Ruby
Expertise with containerization orchestration engines (ie Kubernetes, Mesos)
Working knowledge of modern, continuous development techniques and pipelines (Agile, Kanban, CI/CD, Jenkins, Git, Artifactory)
Experience working within software development or Internet-related industries, particularly in the context of a SaaS offering.
B.S. degree in Computer Science or related technical field