Senior Site Reliability Engineer Jobs - Remote Work From Home & Flexible
-
28 days agoWe're looking for aSenior Site Reliability Engineerto improvereliabilityand stability of customer-facing, production infrastructure, serving millions of page views per hour. Our product is used by over 2 million users world-wide across 190...
-
15 days agoWe are looking for an exceptionalSenior Site Reliability Engineerto join our growing team. If you're looking for a real challenge in terms of mission criticality, multi-geographic region deployments, diversity of managed services, and the chance...
-
23 days ago这是一个完美的人是p的机会assionate about automating away classic systemreliabilityissues, coming up with creative solutions to scale impact. We are looking for someone who has an aversion to the throw more people at the on-...
-
New!3 days agoDeliver a migration tooling orchestrator that has a huge impact on the product scalability,reliabilityand cost. Operate the Search Products. Run and improve our homemade Edge Load balancer. Experience building and operating distributed systems at scale.
-
30+ days agoDesign and implement reliable and scalable systems using softwareengineering最佳实践。开发和部署自动化和monitoring tools to proactively detect and mitigate incidents, and to prevent outages. Partner withengineersacross...
-
FeaturedNew!4 days agoBe part of our team responsible for designing, writing, and delivering software to improve the availability, scalability, latency, and efficiency of services. Work with a team of software and systemsengineerson projects for users responsible for...
-
3 weeks agoOursite reliability engineersbring Python software-engineeringskills and rigour to the operations domain. Architect and run OpenStack, Kubernetes and software defined storage. SoftwareEngineeringor Computer Science degree.
-
15 days agoWill lead the PaaS (Platform as a Service) team ofSite Reliability Engineersresponsible for ensuring thereliability, availability, and scalability of multiple services which have an impact on all the company's products.
-
FeaturedNew!YesterdayAs theSite Reliability Engineer, you will play a key role in designing, developing, and maintaining reliable, scalable, and highly available infrastructure for our API services. You will contribute heavily to the high-impact challenges behind...
-
30+ days agoWork with modern EKS infrastructure and deployment tools like fluxcd and argocd. Support hosted database platforms like Mongo Atlas. Mentor other team members within your areas of subject matter expertise, to avoid creating knowledge silos. Build...
-
16 days agoBe a creative thinker and problem solver and lead technical discussions to deliver on SRE responsibilities. Design and build reliable pipelines for delivering features to production in a timely yet safe manner using modern techniques. Design and...
-
23 days agoPair-programming to collaboratively improve the services that power us. Establishing SLIs and SLOs for the key customer workflows that your team owns. Diagnosing the factors that most threaten SLOs and identifying necessary improvements. Improving...
-
30+ days agoBe on an on-call rotation to respond to incidents that impact availability and drive the efforts to provide service restoration within SLAs. Conduct neutral postmortems of issues and events to identify Root Cause. Use your on-call shift schedule to...
-
30+ days agoChampion and implement a culture of SRE to maintain a high-quality platform infrastructure. Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability and performance.
-
27 days agoDesign, build, and maintain core MTA infrastructure pieces that allow company scaling to support real-time processing and delivery of billions of messages. Experience in managing and working with MTAs including MTA administration as a postmaster.
-
28 days agoDesign, build, and maintain core MTA infrastructure pieces that allow scaling to support real-time processing and delivery of billions of messages. Plan the growth of MTA infrastructure. Automate the deployment process to make it as boring as possible.
-
27 days agoDesign and develop the CI/CD systems developers use and the infrastructure for all current and future websites and services. Diagnose and debug production incidents and then improve systems to prevent the problem from recurring. Collaborate with web...
-
27 days agoDesign and develop the CI/CD systems developers use and the infrastructure for all current and future websites and services. Diagnose and debug production incidents and then improve systems to prevent the problem from recurring. Collaborate with web...
-
Featured30+ days agoDesign, implementation and maintenance of public facing infrastructure and services. Use of configuration management and deployment tools. Architectural design and operation at scale. Monitoring of systems and services, optimization of performance and...
-
12 days agoDesign and build out our cloud infrastructure (we run everything in AWS). Participate in software and system performance analysis, tuning, and service capacity planning. 5+ years of experience in operating high-traffic SaaS environments. Deep expertise...
-
30+ days agoImprove our software delivery pipeline in a way that makes it expedient and encourages a culture of high code quality. Work with business units scale and design systems that are highly available and resilient.Make substantial code contributions to apps.
-
30+ days agoDefine and operationalize service level objectives (SLO) and find sustainable methods for monitoring, managing, and scaling our platforms and services. Distill and synthesize non-functional requirements into discreet and meaningful iterations that can...
-
13 days agoHave strong experience in Linux systems, networking, containers, and troubleshooting. Have enough development experience to write scripts, automation, or lightweight programs or submit patches to the product codebase. Participate in on-call rotations...
-
30+ days agoYou will be an integral part of designing and operating large-scale highly available distributed systems in the cloud. Collaborate with our application development teams to ensure thereliabilityand performance of our infrastructure.Write quality code.
-
8 days agoWork as part of a highly agile team, taking part in the full spectrum of team activities including planning, design, implementation, validation, and retrospection. Become familiar with the Identity Platform, working together withengineering...
-
New!5 days agoYou will be working on a small team using cutting edge technology and tools to build and support infrastructure for our diverse environment including customer facing applications, large scale data processing, and APIs.
-
30+ days agoYou will work with developers to create more scalable services and help us build self-service paved roads to simplify writing services and provisioning infrastructure. You will help to isolate, trap, and respond from the inevitability of system failure...
-
22天前This role will provide support to development teams using development tools like Ansible, Jenkins and GIT for Java applications. Facility planning, storage systems, server systems, website and web applications, LAN, and other IT related systems functions.
-
New!3 days agoDesign, create and expand monitoring and visibility systems to ensure the stability of our team's services. Analyze, and review internal applications to design and implement improvements. Manage infrastructure using best practices of infrastructure...
-
30+ days ago5+ years of combined experience in SRE/DevOps or Software Development roles in a full stackengineeringenvironment. Experience soliciting systems requirements, designing, and implementing new platform components leveraging infrastructure or SaaS...
-
New!TodayBe responsible for managing the entire SRE function at the company. The SREs are responsible for keeping all user-facing services and other company production systems running smoothly. The Director, along with the SREs, are a blend of pragmatic...
-
30+ days agoBe responsible for automating every operational task is a core requirement for Environment Automation SRE. E.g. package updates, configuration changes across all customer platforms without interruptions, tools for automatic provisioning of customer...
-
30+ days agoResearch: stay abreast of the latest trends in related products, fabrication techniques, vendors, quality assurance, and processes. Work on Industrial design and Architecture: evaluate system feature tradeoffs and create the system specifications for,,,
-
3 weeks agoThisseniorrole will be responsible for initiating creativeengineeringsolutions to production process issues through the modification and improvement of existing equipment and sub components, or the design and development of new products.
-
26 days agoTheSeniorSystemsEngineeris responsible for provisioning, configuration, operation, and maintenance of on-premises and cloud-based systems and related infrastructure. You will participate in technical research and development to drive innovation...
-
FeaturedNew!5 days agoBuilding scalable data pipeline infrastructure, libraries and processes using Airflow, Spark, Flink, Kafka. Implementing data quality monitoring that alerts the team of, possible data issues. 5+ years of relevant industry experience.
-
8 days agoHelp architect and deliver the software components, automation infrastructure to enable ourengineeringteams to configure, test, and deploy software quickly, easily and independently. You will help select and bring in best-of-breed tooling as well as...
-
26 days agoUse your passion for testing software, firmware, and hardware to build a solution that stresses our embedded systems and helps identify failure points to improve the design,reliabilityand performance of our products. Test and verify features of the...
-
16 days agoDesign technical implementation and build features as defined by the product team. Write code (principally Typescript) to run in a variety of cloud environments and data warehouses. For more information on how we're using Typescript see this talk on...
-
26 days agoScale and improve our rendering infrastructure to deliver world-class performance to publishedsites. Contribute to rearchitecting our coresitedata layer to provide horizontal scaling forsites. Identify and implement key instrumentation...
-
23 days agoDrive and be a key contributor to the design, development, and ultimately operation of the discovery engine system at scale. Be responsible of the quality, soundness of the system. Work with other teams to identify, troubleshoot, and resolve high...
-
30+ days agoProvide architectural, design and threat-based guidance to software development teams to improve the security posture before code is written. Assess, design, implement, automate, and document security solutions and processes for securing K8s, Private...
-
13 days agoResponsible for designing, implementing and deploying core platform infrastructure and services across all environments, managing incidents as they occur and performing root cause analyses, and writing quality, clean code in Terraform and Chef.
-
26 days agoThe Cyber Policies team is responsible for developing and sustaining a cyber policy platform that provides high integrity for issuance of cyber lines. We focus on meeting corporate KPIs for cyber-based revenue and exceedingengineering...
-
8 days agoParticipate in software development processes and activities with the team. Work cross functionally to refine requirements, plan, estimate, and deliver impactful software. Invest in our developer experience through faster tests, better builds, and...
-
30+ days agoSenior engineerneeded for a full-time, freelance, remote position responsible for writing and verifying code, building applications, troubleshooting and resolving issues, overseeing and supporting teams. Five+ years' development experience required.
-
2 weeks agoBe a part of a development team, quickly understand the product, the architecture and come up with a plan to test/validate functionality, and identify and target weaknesses. Be a part of the development process, collaborating with theengineeringand...
-
30+ days agoSetup of high availability of hosted Production and UAT environments for company products. Responsible for driving the server and computing architecture, vulnerability management and patching environments. Bachelor's degree in Mathematics, Physics and...
-
1 week agoDevelop and ship the first smart contract implementation that operates on the company chain. Define compatibility with and support for existing smart contract execution tool chains and implementations such as EVM and Solidity. Collaborate with other...
-
1 week agoDesign and build petabyte-scale observability platform for allengineeringteams to consume. Develop and improve instrumentation for monitoring and logging the health and availability of services. You have 5+ years of relevant software development...