DevOps

[ follow ]
fromZDNET
3 days ago

I found the best Linux server distros for your home lab

I've had several incarnations of the self-hosted home lab for decades. At one point, I had a small server farm of various machines that were either too old to serve as desktops or that people simply no longer wanted. I'd grab those machines, install Linux on them, and use them for various server purposes. Here are two questions you should ask yourself:
DevOps
fromAnarc
5 days ago

net-tools to iproute cheat sheet

Also note that I often alias ip to ip -br -c as it provides a much prettier output. Compare, before: anarcat@angela:~> ip a 1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host noprefixroute valid_lft forever preferred_lft forever 2: wlan0: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc noqueue state DOWN group default
DevOps
#kubernetes
fromInfoQ
1 month ago
DevOps

Pinterest's Moka: How Kubernetes Is Rewriting the Rules of Big Data Processing

fromInfoQ
1 month ago
DevOps

Pinterest's Moka: How Kubernetes Is Rewriting the Rules of Big Data Processing

fromDevOps.com
1 week ago

Gas Town: What Kubernetes for AI Coding Agents Actually Looks Like - DevOps.com

Steve Yegge thinks he has the answer. The veteran engineer - 40+ years at Amazon, Google and Sourcegraph - spent the second half of 2025 building Gas Town, an open-source orchestration system that coordinates 20 to 30 Claude Code instances working in parallel on the same codebase. He describes it as "Kubernetes for AI coding agents." The comparison isn't just marketing. It's architecturally accurate.
DevOps
DevOps
fromTheregister
1 week ago

Final step to put new website into production deleted it

A well-scripted, tested deployment can still fail when an operator deviates from documented steps, causing outages and undermining careful planning.
DevOps
fromAnarc
1 week ago

Kernel-only network configuration on Linux

The Linux kernel ip= boot parameter configures network interfaces at boot without userland tools, working across distributions and dating to early kernels.
#immutable-linux
fromZDNET
1 week ago
DevOps

Atomic vs immutable Linux: How to decide which distro type is right for you

fromZDNET
2 weeks ago
DevOps

Linux updates don't scare me anymore thanks to this immutable distro (and it's easy to use)

fromZDNET
1 week ago
DevOps

Atomic vs immutable Linux: How to decide which distro type is right for you

fromZDNET
2 weeks ago
DevOps

Linux updates don't scare me anymore thanks to this immutable distro (and it's easy to use)

DevOps
fromApp Developer Magazine
1 year ago

OpenShift 4.21 launches with unified platform for AI and modern apps

OpenShift 4.21 unifies AI training, containerized microservices, and virtualized applications under one operational model, adds intelligent GPU allocation, scaling-to-zero, and enhanced virtualization features.
#amazon-ecs
fromInfoQ
2 months ago
DevOps

AWS Launches ECS Express Mode to Simplify Containerised Application Deployment

fromInfoQ
2 months ago
DevOps

AWS Launches ECS Express Mode to Simplify Containerised Application Deployment

fromNew Relic
1 week ago

5 Best Application Performance Monitoring Tools to Consider in 2026

Support for distributed systems. Check how well the tool handles microservices, serverless, and Kubernetes. Can you follow a request across services, queues, and third-party APIs? Does it understand pods, nodes, clusters, and autoscaling events, or does it treat everything like a static host? Correlation across metrics, logs, and traces. In an incident, you shouldn't be copying IDs between tools. Look for the ability to pivot directly from a slow trace to relevant logs,
DevOps
DevOps
fromLogRocket Blog
1 week ago

Fortifying your stack with Cloudflare: A security playbook - LogRocket Blog

Do not treat edge providers as infallible; design architectures that define clear responsibilities and tolerate edge degradations to preserve availability and security.
DevOps
fromNew Relic
2 weeks ago

Goodbye to False Silences: Automating Reliable NRQL Alerts at Scale

Configure Signal Loss and Gap Filling and automate NRQL alert updates to prevent false silences and maintain reliable telemetry-based alerting at scale.
fromZDNET
2 weeks ago

Want to self-host for free? This server OS makes it easy - here's how to get started

Because of that, you need to be very familiar and comfortable with the command line. Or you can install a desktop environment. In my opinion, this is the single easiest way to make Ubuntu Server easier, especially if you're relatively new to Linux. Having a GUI desktop will strip away the fear of having to use the command line, because you'll have plenty of apps to use (such as the file manager, user manager, GUI app store, and much more).
DevOps
fromTheregister
2 weeks ago

Counting the waves of tech industry BS from blockchain to AI

They called out about half a dozen particular instances of what they considered to be bullshit technology. We were too busy laughing sympathetically to whip out a laptop to make notes, but as best as we can recall the sequence, they were: Containers Kubernetes The "Cloud" Anything at all "as a Service" The Blockchain - anything, everything, based on it And now, arguably the biggest and worst of all, "generative AI"
DevOps
#devops
fromInfoWorld
1 month ago
DevOps

The hidden devops crisis that AI workloads are about to expose

DevOps must move from component-level testing and basic monitoring to platform-level, end-to-end instrumentation and observability to meet AI data volume, velocity, and resilience needs.
fromInfoQ
2 months ago
DevOps

DevOps Is for Product Engineers, Too

DevOps should be practiced to achieve sociotechnical excellence by jointly optimizing social and technical systems across product and platform engineering.
fromDbmaestro
4 years ago

Database Delivery Automation in the Multi-Cloud World

The main advantage of going the Multi-Cloud way is that organizations can "put their eggs in different baskets" and be more versatile in their approach to how they do things. For example, they can mix it up and opt for a cloud-based Platform-as-a-Service (PaaS) solution when it comes to the database, while going the Software-as-a-Service (SaaS) route for their application endeavors.
DevOps
DevOps
fromAmazon Web Services
2 weeks ago

Building a scalable code modernization solution with AWS Transform custom | Amazon Web Services

An open-source infrastructure enables enterprise-scale, parallel AWS Transform custom code modernizations using AWS Batch, Fargate, REST APIs, and CloudWatch monitoring.
DevOps
fromMedium
4 weeks ago

Kubernetes Component statusz-When Your Cluster Finally Learns to Talk!

Component Statusz (KEP 4827) adds in-process, detailed component diagnostics to Kubernetes, improving cluster observability and simplifying debugging of internal component state.
fromInfoWorld
2 weeks ago

The 'Super Bowl' standard: Architecting distributed systems for massive concurrency

When I manage infrastructure for major events (whether it is the Olympics, a Premier League match or a season finale) I am dealing with a &quot;thundering herd&quot; problem that few systems ever face. Millions of users log in, browse and hit &quot;play&quot; within the same three-minute window. But this challenge isn't unique to media. It is the same nightmare that keeps e-commerce CTOs awake before Black Friday or financial systems architects up during a market crash. The fundamental problem is always the same: How do you survive when demand exceeds capacity by an order of magnitude?
DevOps
DevOps
fromTechzine Global
2 weeks ago

Red Hat OpenShift 4.21 brings smart GPU allocation for AI workloads

OpenShift 4.21 introduces Dynamic Resource Allocation for GPUs, autoscaling-to-zero hosted control planes, and cross-cluster live VM migration to optimize AI workloads and costs.
DevOps
fromLogRocket Blog
2 weeks ago

Dokploy vs Coolify: Why Dokploy wins in production - LogRocket Blog

PaaS offerings simplify deployment and scaling but introduce unpredictable costs and vendor lock-in, motivating self-hosted PaaS for greater control and predictable pricing.
#observability
fromInfoQ
3 weeks ago
DevOps

Uber Gets Ready for AI in Network Observability with Cloud Native Overhaul

fromInfoQ
3 weeks ago
DevOps

Railway Highlights the Importance of Logs, Metrics, Traces, and Alerts for Diagnosing System Failure

fromInfoQ
3 weeks ago
DevOps

Uber Gets Ready for AI in Network Observability with Cloud Native Overhaul

fromInfoQ
3 weeks ago
DevOps

Railway Highlights the Importance of Logs, Metrics, Traces, and Alerts for Diagnosing System Failure

#kubernetes-135
#clickhouse
fromInfoQ
3 weeks ago

OpenEverest: Open Source Platform for Database Automation

Percona recently announced OpenEverest, an open-source platform for automated database provisioning and management that supports multiple database technologies. Launched initially as Percona Everest, OpenEverest can be hosted on any Kubernetes infrastructure, in the cloud, or on-premises. The main goal of the project is to avoid vendor lock-in while still providing an automated private DBaaS. Built on top of Kubernetes operators, it aims to avoid complex deployments that depend on a single cloud provider's technology.
DevOps
fromNew Relic
3 weeks ago

Preventing network outages: How we use New Relic to monitor our multi-cloud infrastructure

Running a global observability platform means one thing above all: your infrastructure must never go down. When you're responsible for monitoring thousands of customers' applications 24/7, network failures aren't just inconvenient, they're existential threats. At New Relic, hundreds of clusters run on multiple clouds, and regions. These clusters depend on a complex web of network connections: regional transit gateways, inter-regional hubs, and cross-cloud links.
DevOps
DevOps
fromZDNET
3 weeks ago

7 open-source apps I'd happily pay for - because they're that good

Many high-quality open-source applications exist across Linux, MacOS, and Windows; some are indispensable enough that users would willingly pay for them.
fromZDNET
3 weeks ago

Need to manage virtual machines on Linux? I found an easier way

I recently wrote about my migration away from VirtualBox to KVM/Virt-Machine for my virtual machine needs. I've found those tools to be far superior (albeit with a bit more of a learning curve) than VirtualBox. Since then, however, I've found another method of working with KVM (the Linux kernel virtual machine technology), one that not only allows me to create and manage virtual machines on my local computer, but also from any machine on my LAN. That tool is Cockpit, which makes managing your Linux machines considerably easier.
DevOps
fromZDNET
3 weeks ago

The only Linux command you need for monitoring network traffic - and how to use it

Linux has a tool for everything. Sometimes those tools come in the form of an easy-to-use GUI, and other times a command is necessary. For monitoring network traffic, your best bet is the command line. Once you dive down the rabbit hole of possible commands for this task, you could become overwhelmed with choices -- and with the complexity of some of those commands.
DevOps
DevOps
fromInfoWorld
4 weeks ago

12 principles for improving devsecops

Apply SaaS-derived devsecops principles—shift-left practices, expanded test automation, and SLO-driven observability—to deliver reliable, performant, and secure enterprise applications.
fromInfoWorld
4 weeks ago

Stop treating force multiplication as a side gig. Make it intentional

Lead without authority. You may not have direct reports, yet you shape architecture, quality and the roadmap. Your leverage comes from artifacts, reviews and clear standards, not from title.I started by publishing a lightweight architecture template and a rollout checklist that the team could copy. That reduced ambiguity during design and cut review cycles by nearly 30 percent
DevOps
DevOps
fromInfoQ
1 month ago

OpenCost Looks Back on 2025 Milestones and Charts a Roadmap for 2026

OpenCost expanded cost visibility and automation in 2025 with 11 releases, an AI-ready MCP server, improved multi-cloud tracking, enhanced usability, and stronger community contributions.
fromMedium
1 month ago

Securing Microservice Communication with Istio and Envoy Sidecars

As organizations increasingly adopt cloud-native architectures, managing communication between microservices becomes a critical challenge. Modern applications are often distributed across multiple Kubernetes pods and ensuring secure, reliable and observable interactions between these services is essential. This is where Istio and Envoy sidecars come into play. Together they form a service mesh solution that abstracts networking complexities, enforces security policies and provides deep observability - all without requiring changes to application code.
DevOps
DevOps
fromTechzine Global
1 month ago

Chainguard expands EmeritOSS with ten new projects

Chainguard's EmeritOSS assumes maintenance for ten mature open-source projects, providing dependency updates, builds, and releases to ensure continued reliability.
DevOps
fromInfoQ
1 month ago

Salesforce Migrates 1,000+ EKS Clusters to Karpenter to Improve Scaling Speed and Efficiency

Migrating 1,000+ EKS clusters to Karpenter reduced scaling latency, simplified operations, lowered costs, and enabled more flexible self-service infrastructure for developers.
DevOps
fromTechzine Global
1 month ago

Culture, not code, is the biggest challenge for Kubernetes

Cloud native technologies are widely adopted, but further growth depends on overcoming cultural resistance within organizations rather than technical limitations.
fromInfoQ
1 month ago

HumanCentred AI for SRE: MultiAgent Incident Response without Losing Control

Hakboian describes a pattern in which specialised agents: one for logs, one for metrics, one for runbooks and so on, are coordinated by a supervisor layer that decides who works on what and in what order. The aim, the author explains, is to reduce the cognitive load on the engineer by proposing hypotheses, drafting queries, and curating relevant context, rather than replacing the human entirely.
DevOps
DevOps
fromInfoQ
1 month ago

Pulumi Adds Native Support for Terraform and HCL

Pulumi now natively supports HashiCorp Terraform and OpenTofu, executing HCL and hosting Terraform state to enable mixed-tool infrastructure and migration.
DevOps
fromInfoWorld
1 month ago

How Ansible does the real work in hyperautomation

Hyperautomation combines RPA, IaC, AI/ML, NLP, intelligent workflows and process mining, with Ansible executing infrastructure and configuration changes across environments.
DevOps
fromAmazon Web Services
1 month ago

From AI agent prototype to product: Lessons from building AWS DevOps Agent | Amazon Web Services

AWS DevOps Agent employs a lead-and-sub-agent architecture to provide accurate, performant incident response and root-cause analysis for native AWS applications.
DevOps
fromInfoQ
1 month ago

Platform-as-a-Product: Declarative Infrastructure for Developer Velocity

A unified configuration layer centralizes application and infrastructure intent, simplifying developer workflows while enabling FinOps validation, consistent deployments, and platform-aligned visibility and compliance.
DevOps
fromTechzine Global
1 month ago

What Microsoft Azure Local can and cannot do

Azure Local delivers Azure cloud functionality on-premises, using Hyper-V/Stack HCI, validated server hardware, and Azure Portal management for gradual hybrid migration.
#linux
fromMedium
3 months ago
DevOps

What is swap memory in linux? What It Really Is, Why It Exists, and How to Actually Use It

fromMedium
3 months ago
DevOps

What is swap memory in linux? What It Really Is, Why It Exists, and How to Actually Use It

DevOps
fromInfoWorld
1 month ago

From distributed monolith to composable architecture on AWS: A modern approach to scalable software

Migrating distributed monoliths to a composable AWS architecture yields loosely coupled, autonomous services that improve scalability, resilience, deployment velocity, and team autonomy.
DevOps
fromMedium
3 months ago

Unified Observability Through Open Standards and Distributed Tracing

Unified observability requires open standards and distributed tracing (e.g., OpenTelemetry) to correlate logs, metrics, and traces across distributed cloud-native systems.
#docker
fromMedium
3 months ago
DevOps

Mastering Docker Daemon Configuration on Linux: systemd, Sockets, TLS & daemon.json Explained

fromMedium
3 months ago
DevOps

Mastering Docker Daemon Configuration on Linux: systemd, Sockets, TLS & daemon.json Explained

#docker-compose
DevOps
fromInfoQ
1 month ago

Cloudflare Scales Infrastructure as Code with Shift-Left Security Practices

Infrastructure-as-Code with mandatory peer review and automated policy enforcement prevents configuration incidents, increases velocity, and catches security violations before deployment across hundreds of production accounts.
DevOps
fromTheregister
1 month ago

Microsoft euthanizes ancient deployment toolkit

Microsoft has immediately retired Microsoft Deployment Toolkit (MDT), ending updates, patches, and support and urging migration to Autopilot or Configuration Manager OSD.
DevOps
fromMedium
1 month ago

Who's Spotting You When You Automate

Temporal awareness in ITSM approval automation builds trust by providing past, present, and future visibility so automation and humans can share judgement safely.
fromInfoQ
1 month ago

Fast Eventual Consistency: Inside Corrosion, the Distributed System Powering Fly.io

What do we do at Fly? We are a developer-focused cloud platform. That means we make it easy for developers to get their apps deployed, up and running. Something I think that really differentiates us is that we make it easy to deploy your apps in different regions over the world. We are available in 40 different regions. It's basically like a CDN, but for your apps.
DevOps
DevOps
fromStephane's Blog
1 month ago

Automating TLS Certificate Monitoring with GitHub Actions, certificate_watcher, and Slack

Combine certificate_watcher with a weekly GitHub Actions workflow and Slack notifications to monitor SSL/TLS certificate expirations serverlessly using a Git-hosted hosts list.
fromComputerworld
1 month ago

5 areas of ITSM being transformed by automation in 2026

Automation is transforming IT service management (ITSM), moving service desks from reactive, manual workflows toward systems that can intelligently route, prioritize, and resolve issues with minimal human intervention. Recent research from Freshworks found that IT professionals lose nearly seven hours every week-almost a full workday-to fragmented tools and overly complicated work processes. Implementing ITSM automation reduces manual effort, accelerates resolution, improves consistency and accuracy, enables proactive issue prevention, and delivers faster, more reliable service that measurably improves employee and end-user satisfaction.
DevOps
DevOps
fromInfoQ
1 month ago

Docker Kanvas Challenges Helm and Kustomize for Kubernetes Dominance

Docker Kanvas enables developers to convert local Docker Compose setups into production-ready Kubernetes deployments with automated cloud provisioning and Infrastructure-as-Code generation.
DevOps
fromInfoQ
1 month ago

Slack Enhances Chef Infrastructure to Improve Safety and Reduce Blast Radius in Deployments

Slack reduced deployment risk by splitting the Chef production environment into availability‑zone tied buckets and using Chef Summoner for staggered, artifact‑triggered runs.
fromPythonbytes
1 month ago

Malicious Package? No Build For You!

Charlie Marsh announced the Beta release of ty on Dec 16 &quot;designed as an alternative to tools like mypy, Pyright, and Pylance.&quot; Extremely fast even from first run Successive runs are incremental, only rerunning necessary computations as a user edits a file or function. This allows live updates.
DevOps
#swap
fromMedium
3 months ago
DevOps

What is swap memory in linux? What It Really Is, Why It Exists, and How to Actually Use It

fromMedium
3 months ago
DevOps

What is swap memory in linux? What It Really Is, Why It Exists, and How to Actually Use It

fromMedium
3 months ago
DevOps

What is swap memory in linux? What It Really Is, Why It Exists, and How to Actually Use It

fromMedium
3 months ago
DevOps

What is swap memory in linux? What It Really Is, Why It Exists, and How to Actually Use It

DevOps
fromInfoQ
1 month ago

From Confusion to Clarity: Advanced Observability Strategies for Media Workflows at Netflix

One hour-long Netflix episode encoding generates millions of trace spans, thousands of microservice calls, hundreds of media encodes, and over 100,000 CPU hours.
fromMedium
3 months ago

Unified Observability Through Open Standards and Distributed Tracing

Unified observability signifies the possession of one consistent view over all the parts of a distributed system - applications, infrastructure, and services. With modern systems being spread over several clouds, microservices, and APIs, the only way to have such visibility is through open standards and distributed tracing. Standards such as OpenTelemetry not only make it easier to collect but also to correlate the telemetry data (logs, metrics, and traces) from various components.
DevOps
DevOps
fromInfoQ
1 month ago

AWS Announces New Amazon EKS Capabilities to Simplify Workload Orchestration

Amazon EKS Capabilities provides fully managed, Kubernetes-native features (Argo CD, ACK, KRO) to simplify orchestration, AWS resource management, and automation on EKS.
DevOps
fromFast Company
1 month ago

Software resilience testing is more critical than ever

Many companies lack resilience testing, leaving systems vulnerable to cascading outages that cause massive financial, operational, and reputational damage; resilience testing limits such risks.
fromInfoWorld
1 month ago

2026: The year we stop trusting any single cloud

For more than a decade, many considered cloud outages a theoretical risk, something to address on a whiteboard and then quietly deprioritize during cost cuts. In 2025, this risk became real. A major Google Cloud outage in June caused hours-long disruptions to popular consumer and enterprise services, with ripple effects into providers that depend on Google's infrastructure. Microsoft 365 and Outlook also faced code failures and notable outages, as did collaboration platforms like Slack and Zoom. Even security platforms and enterprise backbones suffered extended downtime.
DevOps
fromMedium
2 years ago

Navigating Through the Storm

When a system is overwhelmed with more requests than it can effectively process, a cascade of problems can ensue, significantly undermining its performance and reliability. One of the most immediate and noticeable consequences is the degradation of performance. In such scenarios, users may face frustratingly slow response times or complete timeouts in more severe cases. This not only hampers the user experience but can also erode trust in the system's reliability.
DevOps
DevOps
fromZDNET
1 month ago

How I ditched Google Photos for my own private self-hosted alternative - for free

Immich provides a free, self-hosted Google Photos–like service that requires Docker and can be installed on local Linux, macOS, or Windows machines.
DevOps
fromInfoQ
1 month ago

Docker Makes Hardened Images Free in Container Security Shift

Docker released over 1,000 hardened container images under Apache 2.0, providing secure, non-root, minimal base images with SBOMs and SLSA provenance for all developers.
fromInfoQ
1 month ago

How Authress Designed for Resilience and Survived a Major AWS Outage

Identity and authentication services company Authress shared its strategy to stay operational during major cloud infrastructure outages like the massive October 2025 AWS outage that disrupted many major services. The company's resilience architecture relies on strategies like multi-region deployment and minimizing reliance on AWS control plane services, Authress CTO Warren Parad explains. Parad says the AWS October 20 incident was the worst seen in a decade. Even so, Authress maintained its SLA reliability commitments thanks to a reliability-first design centered on a failover routing strategy.
DevOps
#docker-daemon
fromMedium
3 months ago
DevOps

Mastering Docker Daemon Configuration on Linux: systemd, Sockets, TLS & daemon.json Explained

fromMedium
3 months ago
DevOps

Mastering Docker Daemon Configuration on Linux: systemd, Sockets, TLS & daemon.json Explained

fromMedium
3 months ago
DevOps

Mastering Docker Daemon Configuration on Linux: systemd, Sockets, TLS & daemon.json Explained

fromMedium
3 months ago
DevOps

Mastering Docker Daemon Configuration on Linux: systemd, Sockets, TLS & daemon.json Explained

fromAzure DevOps Blog
2 months ago

The New Test Run Hub is Going Generally Available! - Azure DevOps Blog

Real-Time Visibility: Instantly monitor test progress and quality trends to catch regressions before they impact your release. Comprehensive Analytics: Dive into historical data with built-in dashboards that break down results by outcome, priority, configuration, and failure type. Effortless Management: Use powerful filters such as timeline, run type, pipeline, and more, to find exactly what you need. Customize your view with persistent search and column visibility settings.
DevOps
DevOps
fromLondon Business News | Londonlovesbusiness.com
2 months ago

Beyond the migration: Why cloud strategy consulting is the architect of modern business growth - London Business News | Londonlovesbusiness.com

Nearly 30% of cloud spend is wasted due to poor architecture and planning; cloud strategy consulting is essential to align IT with business outcomes.
[ Load more ]