Md Bazlur Rahman Likhon | Top Gen AI & Cloud Engineering Blog

AI engineering notes for teams building real systems.

Production-minded writeups on LLM apps, RAG, agents, cloud architecture, and delivery tradeoffs. Less theory, more field-tested patterns we can actually ship.

1 Visible Articles

3 Topics Covered

Latest Current View

Practical AI and cloud playbooks, not recycled thought pieces.

Use the filters to jump between delivery topics, architecture patterns, and implementation notes. The newest article is featured first so the page always feels current.

Filtering by topic: Semantic Search

Search by title, excerpt, or article body content.

Sorted newest to oldest so the freshest article stays on top.

Start with the latest article

This lead story reflects your current filter settings.

Latest LLM Ops Jan 20, 2026

RAG Cost Optimization: Cutting $4.12 to $1.11 Per 1,000 Queries Without Sacrificing Recall

This post breaks down how we reduced real-world RAG system costs from $4.12 to $1.11 per 1,000 queries”without sacrificing recall or latency. Based on optimizations deployed across 11 enterprise pipelines handling 40M+...

Read article See implementation examples

Catalog

All available articles

Browse deeper for architecture notes, implementation patterns, and delivery lessons.

This view contains one matching article.

The featured story above is the only article that matches your current filters. Clear the filters to explore more of the archive.

Browse all articles

Turn Reading Into Delivery

Need hands-on help after reading?

Move from article ideas to implementation with direct support on architecture, delivery planning, and production execution.

Explore service tracks

Browse AI engineering, cloud architecture, security, and delivery offerings in one place.

Estimate project cost

Compare engagement models before you scope an implementation or advisory project.

Review shipped work

See project examples that connect these blog ideas to real delivery outcomes.