Yesterday in the Tech Town Hall the only anonymous QA submission called out SRE for being in a silo and I just couldn't be more proud that someone besides me is seeing it and taking action.
System Administration
Week 11, Configuration Management II
In this video, we continue our discussion of configuration management systems. We talk about state assertion, what states of a host we might care about, the CAP theorem and other fallacies of distributed systems, idempotence, eventual consistency and convergence, and the overlap of CM systems with other infrastructure components, yielding, eventually, infrastructure as a service.
¿Que está esto probado?... ¡mis c0j0n3s!, (buscando una traducción polite a Inglés para una cosa) Síganme para más miserias de un #SRE
Celebrate the 25th anniversary of @lfnw by sharpening your #DevOps and #SRE skills with virtual talks from #Coroot CEO & Co-Founder Nikolay Sivko!
Join in for #Linux themed drinks, hikes, and stop by room DMC 109 Sat & Sun 9:30 - 10:15 to learn new #kubernetes and #observability tips: https://t.ly/Gv0YS
#Visualpath offers an industry-leading #SRE Certification Course to help you master tools like Prometheus, Grafana, and Ansible. Our SRE Online Training Institute in Chennai provides expert-led sessions, real-time projects, and hands-on learning for career growth. Join learners globally from the USA, UK, Canada, Dubai, and Australia. Call +91-7032290546 now to book your free demo session!
Visit: https://www.visualpath.in/online-site-reliability-engineering-training.html
WhatsApp: https://wa.me/c/917032290546
Visit Our Blog: https://visualpathblogs.com/category/site-reliability-engineering/
System Administration
Week 11, Configuration Management I
In this video, we illustrate the general evolution of the management of system configuration and then talk about defining services by abstracting individual requirements for system-specific and service-specific aspects. We present a few sample snippets of Puppet, Chef, and CFEngine code to give you a taste of some common CM systems.
How do you effectively monitor a multi-layered system?
Ana shares how a @ferretdb team (a great #opensource #MongoDB alternative!) saved time debugging and reduced overhead costs by transforming manual telemetry analysis into automated, instant insights: https://t.ly/j6PVi
Want to grow your open source career? The #LiFTScholarship offers FREE training & certification for #DevOps, #SRE, #SysAdmins & more!
Apply by April 30: https://app.smarterselect.com/programs/102338-Linux-Foundation-Education
@ChrisLAS @ironicbadger really sad to hear about the #selfhosted #podcast reaching #EOL, I've been with you since the single-digit episodes, was an #SRE supporter then Jupiter.party, it was SelfHosted that brought me to #JupiterBroadcasting all those years ago.
Will be really sad to see it go, the cadence was great, and you two made wonderful hosts.
Sorry about the #AdWinter, afraid that is what is doing in so many JB shows like SH and Coder Radio.
System Administration
Week 10, Backups by example
In this video, we illustrate how to perform backups using tar(1) (overcoming xkcd/1168), dump(8) and restore(8), and rsync(1), both locally and to a remote system.
On Friday which is typically a payday for weekly wages workers, there was some kind of outage that prevented #HomePay (from Care[dot]com) from paying out salaries to domestic workers like nannies, maids, babysitters, etc. They subsequently had a message on their website login screen, but for most of the day for many there was no clarity on when the funds would be dispersed to the workers. Customer care had long wait times due to this issue too. Since funds are typically collected on the Wednesday before, it was already gone from the families accounts who employed them. They eventually sent out communication indicating the delayed payroll would be paid out on Monday.
I’m surprised with such a major payroll platform having a payout outage and there was no news coverage I could find on the subject. I’m really interested in understanding what was the technical issues causing this problem. Also what banking service is HomePay using?
Three major types of #DNS failures are timeouts, latency, and DNS NXDOMAIN errors. Observability tools often can overlook these areas - so do they matter? Well, it depends how many services you would like to break.
Assuming your answer is ‘none’ – head over to our blog to learn how to master effective DNS observability and keep your system running failure-free:
#Kubernetes is more than a containerized app platform: it can provide better database management and automation for your business without vendor lock-in.
Learn how you can set up a #postgreSQL kubernetes cluster and manage anomalies to keep your organization’s databases running smoothly and cost-effectively in the #cloudnative world:
this week I'm reading Human Factors in Systems Engineering
there are so many gems I've highlighted already but really vibed with how the author clearly and simply expressed the impact of writing docs "early" here
Are you looking for a new remote job? Browse 400+ remote positions from open source companies including @acquia @grafana @mozilla @wikimediafoundation and more on #OSJH
https://opensourcejobhub.com/jobs/?q=remote&utm_source=mosjh
#career #OpenSource #engineer #sales #security #marketing #CloudNative #developer #DevSecOps #SRE #FOSS