Being a System Architect in the Age of AI: Tools Change, But the
How is the artificial intelligence revolution affecting system architecture? With 20 years of experience, I evaluate AI's promises and the unchanging.
32 posts found.
How is the artificial intelligence revolution affecting system architecture? With 20 years of experience, I evaluate AI's promises and the unchanging.
Understanding PostgreSQL's MVCC mechanism is critical for performance and data consistency. Common mistakes and their solutions when developing applications...
We examine 3 common misconceptions in push notification delivery and the issues they cause in real-world systems. Improving reliability...
I examine the technical reasons behind mobile push notification delivery issues with my 20 years of system architecture experience. Problems, solutions, and...
Mustafa Erbay shares his experiences on the importance, usage, and practical tips for metric and trace data to deeply understand system issues…
Regularly rotating secrets in systems is a critical security step. Drawing from my own experiences, I'll discuss secret rotation strategies and practical...
How does metric cardinality affect system performance? In this guide, we delve deep into overlooked burdens and developer mistakes.
A deep dive into REST, GraphQL, and gRPC API design approaches. I compare them with concrete examples to help you choose the best fit for your project.
We examine methods for reducing APK and IPA packages, R8/ProGuard settings, and CI/CD processes in mobile app size optimization.
I explain the intricacies of LLM inference caching and what to consider when balancing cost and latency, with practical examples.
Exploring the differences, benefits, and real-world applications of storing system and application logs in structured (structured) or unstructured.
A deep dive into the real-world risks of agent tool usage and why these risks are often overlooked, based on Mustafa Erbay's experiences...
A detailed look at the Out-of-Memory (OOM) Killer incidents I experienced on my VPS, the intricacies of system memory management, and the silent deaths caused.
I explain how I solved duplicate records and token waste issues in AI content generation processes using idempotency principles.
A first-hand account of the SQLite concurrency and lockout problems I faced in the islistesi.com project, with the solution steps and lessons learned.
We dig deep into the complex operational challenges, hidden dangers and potential dead ends of distributed lock mechanisms.
Want to understand the hidden swap trap on Linux systems and learn memory management strategies for high-performance systems? Detailed…
Disaster recovery tests aren't only about technology. In this post we dive into the human factor and processes that decide DR plan success...
Environment Variables play a vital role in application configuration. But mismanaging them can leak hidden secrets and…
An in-depth look at the long-term costs and risks created by a simple 'hardcoding' decision in system architecture.
A deep look at the risks the eventual consistency model brings to distributed systems, and how to prevent critical data loss like missing orders.
How do firewall rule dependencies in production turn network management into a tangled nightmare? I walk through the real challenges and the strategies…
In a world where we keep pushing the limits of automation, what is the cost of losing the human factor? Technology and the future from an old engineer's…
Take an in-depth look at the invisible network disasters caused by DNS resolution failures and the impact this critical issue has on businesses.
The critical security and operational risks that expiring certificates cause in production environments, why they slip through the cracks, and effective…
Discover the hidden impact of reverse proxy buffer settings on performance and security. Optimization tips and tricks on the Mustafa Erbay blog!
Learn why time synchronization is critical in distributed systems and how to detect and resolve the elusive 'phantom bugs' it can cause.
The source of those unnoticed performance problems on your VMware ESXi cluster might just be Storage I/O Control. A detailed look and optimization advice.
Discover the hidden network dependencies that quietly bring production systems down. This article walks through the causes, symptoms, and prevention…
The operational crises I keep running into when I manage cloud infrastructure with GitOps — and the patterns that have helped me avoid the worst of them.
Discover the causes and types of model drift in Edge AI systems, plus how to handle the problem with automated rollback mechanisms.
Explore the hidden traps and possible failure modes inside the auto-renewal process of certificates that are vital to digital security. Don't let your security…