Category: Generative AI

Building Fault-Tolerant AI Pipelines: Smarter Load Balancing and Workflow Orchestration

Apr 18, 2025

—

by

MJ

in Architecture, Generative AI

As AI systems become integral to real-time applications, engineering leaders face the challenge of keeping complex AI pipelines both responsive and resilient. In particular, orchestrating workflows that involve multiple AI models — often calling out to external large language model (LLM) APIs — requires careful design to ensure fault tolerance. This article explores the hurdles…
Prompt Engineering or Text Tinkering: What are we co-creating with GPT models?

Aug 29, 2023

—

by

MJ

in Generative AI

TLDR; I can’t wait until universities start offering degrees in “Prompt Engineering” (troll) — If that happens, I’ll know the GenAI hype cycle has reached the top. Kidding aside, it is important to think about our interactions with GenAI because in a few years these applications will be as ubiquitous and as much a part…