Principal Site Reliability Engineer

news/2025/4/2 5:07:19/

远程工作分享,非美国地区也可以申请,链接:https://www.linkedin.com/jobs/search/?currentJobId=3539602504。

我只是找工作的时候看到的,觉得还可以做个分享,不负责hire,不负责communicate,只是工作的搬运工。

这个职位一周前post的,相对较新,完整需求在下面。

About the job

About Bobsled

Our goal at Bobsled is to transform the way data is shared across organizations, clouds, and data platforms. Our cross-cloud platform enables enterprises to share data quickly and securely through one unified control plane that manages all aspects of data sharing, including replication, updates, versioning, entitlements, telemetry, and more.

By Solving These Problems We Will

  • Remove barriers to collaboration between organizations
  • Facilitate and democratize the use of data to enable better decision making

We believe that by using data collaboratively, we can enable better solutions to the world’s hardest problems.

The Role

We are looking for a Principal Site Reliability Engineer to support the operational excellence of Bobsled’s data sharing platform. You’ll apply your expertise to complex technical and business challenges and develop innovative solutions that meet requirements concerning functionality, performance, observability, scalability, and reliability. You will be part of the team designing and managing our platform, and your work will have an enormous impact on the way organizations use data across the world.

As an early hire, you will also play a pivotal role in building our team and culture, fostering a collaborative environment, and assessing engineering candidates.

Key Responsibilities

  • Be a creative thinker and problem solver and lead technical discussions to deliver on SRE responsibilities.
  • Design and build reliable pipelines for delivering features to production in a timely yet safe manner using modern techniques.
  • Design and implement logging, monitoring, observability capabilities as well as bespoke tools to manage Bobsled’s products and services running on global multi-cloud infrastructure.
  • Be instrumental in the design and implementation of Bobsled’s incident response process adhering to modern best practices.
  • Participate in on-call rotation and respond to issues that impact Bobsled availability, and provide support with customer incidents.
  • Participate in design discussions with other teams to promote SRE principles and ensure code delivered is of production quality.
  • Be aware of changes in software best practices and new technologies which Bobsled could adopt to improve our security posture, cost margins and feature velocity.

Preferred Qualifications

  • 8+ years experience as a senior/principal SRE or similar role responsible for managing distributed cloud systems in production.
  • Required to work with Typescript and Terraform (CDKTF), but experience in other modern languages will be considered.
  • Expert knowledge of monitoring principles and modern alerting techniques at scale and tooling required to deliver on these.
  • Good knowledge of credential/secret management which deliver modern best practices and to assist achieving security compliance certifications.
  • Good knowledge of infrastructure as code concepts and CI/CD pipelines.
  • Good knowledge of cloud infrastructure and provider databases. Serverless knowledge is a big plus.

Compensation

  • US Salary Range: $160-200K
  • Outside the US salaries are adjusted to account for differences in payroll taxes, cost of providing benefits, and FX costs
    We also offer competitive equity compensation

Benefits

  • Health Insurance (for US employees): Medical (100% paid), dental and vision benefits for you and your family
  • Generous PTO policy and paid parental leave
  • Fully upgraded Apple MacBook and 4K monitor (for engineering team only)
  • Home office stipend of $1,000
  • Flexible work hours in fully-remote work environment
  • Fully-sponsored individual coaching for all employees to help foster a culture of personal reflection and growth (optional though encouraged)

We understand that no candidate is perfectly qualified for any job. Experience comes in different forms; many skills are transferable; and passion goes a long way. Even more important than your resume is a clear demonstration of skill, dedication, and the ability to thrive in a fluid and collaborative environment. We want you to learn new things in this role. We’re hiring at multiple levels of seniority, so we encourage you to apply if your experience is close to what we’re looking for.

We are committed to fostering and empowering an inclusive community within our organization. We do not discriminate on the basis of race, religion, color, gender expression or identity, sexual orientation, national origin, citizenship, age, marital status, veteran status, disability status, or any other characteristic protected by law.


http://www.ppmy.cn/news/74312.html

相关文章

力扣LCP 33. 蓄水

LCP 33. 蓄水 给定 N 个无限容量且初始均空的水缸,每个水缸配有一个水桶用来打水,第 i 个水缸配备的水桶容量记作 bucket[i]。有以下两种操作: 升级水桶:选择任意一个水桶,使其容量增加为 bucket[i]1 蓄水&#xff1…

速率控制(RATE control, RC)原理简介

速率控制(RATE control, RC) ⚫️速率控制(RATE control, RC)是H265中用于控制传输速率的一种技术,简单来说,就是通过对量化参数QP和拉格朗日因子lambda的控制,使得视频的每秒压缩后的大小尽可…

TikTok掀动出海淘金潮

嘉晟迪科:在各行各业都已经卷成红海的今天,最稀缺的是什么?当然是增长。那么,增长在哪里?流量在哪里,需求就在哪里,增长也就在那里。 因为短视频风靡全球的流行,内容平台特别是短视频…

4.1 一级存储结构

本节介绍 GPU 上的一级缓存结构,重点介绍统一的 L1 数据缓存和暂存器“共享内存”,以及它们如何与计算核心交互。 我们还简要讨论了 L1 纹理缓存的典型微架构。 我们包括对纹理缓存的讨论,虽然它在 GPU 计算应用程序中的使用有限,…

12 IO1

File类中的常用方法有哪些? 1.String getName() :获取文件名称 2.String getPath():获取文件路径 3.String getAbsolutePath():获取绝对路径 4.File getParentFile():获取上级目录文件 5.boolean exists():判断文件是否存在…

05. 数据结构之队列

前言 队列(queue)是一种线性数据结构,队列中的元素只能先入先出(First In First Out,简称 FIFO)。队列和实际生活中的排队相对应,是一种和生活息息相关的数据结构,在很多系统中都会…

VMware ESXi 6.0 多网卡接入 多网段绑定 虚机接入不同网段

网卡要与对应网段的网络联通。不同的网卡接入不同网段的网络。要为vmware esxi 6 的多个虚机配置不同网段的ip地址,首先选择主机对应的网口分别插上处于在不同网段的网线。 配置管理网络 多个网口接入,只可以配置一个管理网络,就是只有一个网…

干货 | 利用SPSS进行高级统计分析第一期

Hello,大家好! 这里是壹脑云科研圈,我是喵君姐姐~ 你是否还在为分析实验数据而感到头疼?你是否还在苦于自己不知道如何选择合适的模型来分析数据? 本期我们就来为大家带来了利用SPSS软件进行高级统计分析…