Job Title:
Sr. Embedded Site Reliability Engineer (Samsung Ads)

Company: Samsung Electronics America

Location: New York City, NY

Created: 2024-04-23

Job Type: Full Time

Job Description:

Samsung Ads focuses on enabling brands to connect with Samsung audiences as they are exposed to digital media across all devices. Being part of an international company such as Samsung and doing business worldwide means that we get to work on big complex projects with stakeholders and teams located around the globe.Our purpose is to deliver unparalleled results for our customers. Using the Samsung Ads uniquely transforms the advertising landscape by using comprehensive data to build the world's most intelligent connected audience platform. We deliver on Samsung Electronics' 51-year commitment to excellence through smart, easy, effective advertising solutions to make advanced video advertising work.What you will doAs an embedded SRE you'll be partnering with different software development teams and act as a subject matter expert on all aspects of reliability, including usability, performance, cost, scale and observability.The ideal candidate has deep technical knowledge and a strong interest in process automation, data driven insights, software-defined infrastructure, and approaches it from the perspective of a software engineer. Challenges of globally distributed high volume/low latency services, deciding what and when state should be shared, and designing for failure scenarios should drive you.You will work with some incredibly talented and passionate developers with a solid technical background to bring products and services to a market with unique technical challenges.The following information provides an overview of the skills, qualities, and qualifications needed for this role.Key ResponsibilitiesCo-architect services for fault-tolerance, self-healing, and establish clear scaling pathsBe a subject matter expert for the challenges of infrastructure and operation within the teamTranslate Product Owner requirements into actionable technical tasksBe data-driven through observability and use the insights to propose reliability roadmap workContribute to the Samsung Ads global SRE practiceEmpower your development team with tooling and automation, including CI/CDContinuously improve internal services for ease of packaging, configuration, and deploymentOptimize all layers (hardware, software) for high service performanceContribute to runbooks, disaster recovery plans and run chaos engineering exercisesCo-own technical relationships with several service providers and vendorsEstablishing capacity and growth plans to be executed by automationRequired Skills And ExperienceStrong expertise in AWS and administrating and scaling Kubernetes native applications (CKA, CKAD, CKS are nice to have)Strong expertise in building and managing CI/CD pipelinesRelevant software engineering experience with at least one language (Go, Ruby, Python, Erlang or Java)Understanding of microservices, distributed systems and client-server architecturesStrong Linux system administration and troubleshooting skills, including knowledge of how the various components work (kernel, CPU, memory, disk, network)Experience with Infrastructure as Code tools like TerraformKnows how to make the most of an agile multi-team environmentBe resourceful, inventive, and passionate about technologyYou are eager to challenge the status quoDemonstrated ability to prioritize tasks and promptly resolve problemsProactive, addresses potential issues before they occurTrack record of making things better and leading solutions that remove technical pain points and facilitate growthExcellent communication skills in English is essential If you're interested in joining a rapidly growing team working to build an outstanding, world-class advertising organization with a relentless focus on design and customer experience, you've come to the right place.