READ

PUBLISH

My news Find Feeds Delivery settings Plans My earnings

Follow us on X (twitter)
Follow us on Facebook

Login

Find more feeds

Blogs, Ideas, Train of Thoughts

Follow Blogs, Ideas, Train of Thoughts's news and updates in a matter of seconds! We will deliver any update via email, phone or you can read them from here on the site on your own news page.

You can even combine different feeds with the feed for Blogs, Ideas, Train of Thoughts.

Subscribing and unsubscribing is fast, easy and risk free.

The whole service is free of cost.

Blogs, Ideas, Train of Thoughts: Blogs, Ideas, Train of Thoughts -

Is this your feed? Claim it!

Publisher: Unclaimed!

Message frequency: 0 / week

Message History

Under the Hood of Rerankers: Scoring, Models, and Trade-Offs8 days ago

In my earlier post — Understanding Re-Rankers: The Key to Smarter Search Results — we explored the basics of rerankers: what they are, why they matter, and how they turn&n...

Read full story

Understanding Re-Rankers: The Key to Smarter Search Results3 months ago

Imagine searching for the “best CrossFit shoes” on a search engine. The initial results will bring up hundreds of options — some highly relevant, others not so much. Somewhere in the mix you’ll see the perfect training shoes, but you might also see casual sneakers, hiking boots, or even sandals.

This is where a reranker comes in. ...

Read full story

LLMs, Token Limits, and Handling Concurrent Requests3 months ago

Large Language Models (LLMs) like GPT-4, Claude, or Gemini are powerful tools, but they don’t run with infinite capacity. Just like your laptop has CPU and memory limits, LLMs exposed via APIs have token limits and throughput constraints you need to design around. href="https://medium.com/@rajesh.sgr?source=post_page---b...

Read full story

Understanding the P95/P99 Latency Principle: Why the Slowest Requests Matter Most4 months ago

When we talk about system performance, the first number people usually quote is the average response time.

For example: “Our API responds in 200ms on average.”

In system performance, the average response time is a common benchmark. Yet, this metric hides a critical truth: users don’t encounter an average — t

Read full story

Little’s Law and Concurrency: Why Your System Gets Slow When It’s Busy4 months ago

When you walk into a coffee shop, you notice something:

If the server is quick, customers don’t wait long.

If the line moves slowly, the shop gets crowded — even if the number of customers arriving stays the same.

This simple idea is captured by Little’s Law, a principle from queuing the...

Read full story

Login to follow.it

Keep me logged in

Forgot password?

Or:

Forgot password?

Do you want to sign up?