I wasn’t originally planning on a write up of SREcon24 Americas, but some ideas bubbled in my head, along with some themes, and there were a ton of solid talks (the speaker line up was wall-to-wall heavy hitters). I’m also curious how I’ll feel about my talk in the future and wanted to capture that to look back on. And… Read more →
Justifying Resilience Work
How do we justify the need to make our organizations more resilient? Quick Primer: in this context, a shorthand for resilience is adapting to unexpected changes in an environment in an attempt to return to stability. It’s not robustness, like an automatic fail-over of a system without human intervention (removing a server from a pool behind a load balancer for… Read more →
Blame Awareness is Universal
Blame awareness only works if you work towards blame awareness with all incidents, not just the ones that affect yo Read more →
On Lightsabers and Resilience
Main Takeaway: Adaptability can only come from expertise, and expertise is developed only through experience. Quite often, that’s failure and even the expectation of failure (gamedays, chaos engineering experiments, architecture reviews, etc.). Also, lightsabers are cool. I’m surprised it’s taken me this long to write about lightsabers. During my lunch breaks I’ve been diving into some Tested.com videos, in particular… Read more →
The Invisible Success of Near Misses
We often talk about blame aware culture. Your teams are continuously working towards building a system where, among many goals, a safe and reliable system is available. When we’re surprised, incidents happen. As we’re working towards safety, and by definition these incidents are surprises, shaming folks for failure is counterproductive and instead we should celebrate the opportunity to learn more.… Read more →
The Dynamic Nature of Company Culture
Company culture can’t be understated. A big portion of what we do revolves around building exciting projects, tackling hard problems, and working with talented folks in our industry. These often constitute high pressure situations that necessitate fostering bonds between coworkers. Some folks want to clock in, sit down and work for eight hours, and then clock out without having interacted… Read more →
Creating Safety is Dangerous Work
Imagine you’re heading down a road – biking, driving in your car, taking a walk – and you notice a trash can roll into the street. You quickly realize that any large obstruction rolling around is a potential hazard, for you and others. It’s not your trash can. It’s not your street. In fact, it’s likely that you can safely… Read more →
Emoji as Incident Resolution Tools
Note: This post is cross posted on Jeli’s blog When we think of the bands of communication that help facilitate our incidents (text chat, phone bridges, video calls), we tend to discount smaller cues, the inbetween that colors and adds depth. It may be on those video calls the notable pause when considering a problem, the tone of voice that… Read more →
Tech Interview Questions: Typing “google.com” in a browser
I’m putting together a series on common questions candidates face in technical screens. My hope is to give folks a leg up on how they can prepare, and secondarily any screeners better understand what they’re asking and how to improve for a better signal.No question (or answer) is perfect, but that shouldn’t stop us from exploring them. Feedback and constructive… Read more →
Connecting with Recruiters, pt 2: External Recruiters
External recruiters, also referred to as third party, contingency, or agency recruiters, differ from internal recruiters as they work for a placement firm rather than a tech company directly. They still want well qualified candidates to put in front of an interview panel, but lacking the direct integration, they instead maintain a network of companies that are looking for candidates.… Read more →