<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Ai on Hrushikesh Dokala</title><link>https://hrushikesh.dev/tags/ai/</link><description>Recent content in Ai on Hrushikesh Dokala</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sat, 30 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://hrushikesh.dev/tags/ai/index.xml" rel="self" type="application/rss+xml"/><item><title>context switching is all you need</title><link>https://hrushikesh.dev/notes/context-switching/</link><pubDate>Sat, 30 May 2026 00:00:00 +0000</pubDate><guid>https://hrushikesh.dev/notes/context-switching/</guid><description>&lt;p>i have 4+ terminals open right now. each one is running claude code (in bypass permissions). each one is working on a different project.&lt;/p>
&lt;p>2 of them are running experiments in independent worktrees, and the other 2 which im working on closely to debug and fix a few prod issues.&lt;/p>
&lt;p>i&amp;rsquo;m not doing any of this. i&amp;rsquo;m just&amp;hellip; orchestrating.&lt;/p>
&lt;h2 class="heading" id="the-old-advice">
 the old advice
 &lt;a class="anchor" href="#the-old-advice">#&lt;/a>
&lt;/h2>
&lt;p>there&amp;rsquo;s a &lt;a href="https://justinwelsh.me/essays/context-switching">popular take&lt;/a> that calls context switching &amp;ldquo;the silent killer of productivity.&amp;rdquo; the advice: batch your tasks, use do-not-disturb, single-task your way to clarity.&lt;/p></description></item><item><title>is ai making us dumber?</title><link>https://hrushikesh.dev/notes/ai-vs-brain/</link><pubDate>Wed, 26 Nov 2025 00:00:00 +0000</pubDate><guid>https://hrushikesh.dev/notes/ai-vs-brain/</guid><description>&lt;p>something which i&amp;rsquo;m observing in myself is:&lt;/p>
&lt;ul>
&lt;li>when i try to read, chat with ai to understand a concept at a deeper level, why does it fade away much faster than what i&amp;rsquo;ve just written down and put some conscious effort into understanding?&lt;/li>
&lt;li>it’s easy to understand things with ai, but is it easy to forget as well?&lt;/li>
&lt;li>do you think faster learning without cognitive effort will result in losing the learning from memory/understanding?&lt;/li>
&lt;/ul>
&lt;p>are people jumping into answers and thinking less?&lt;/p></description></item><item><title>moe - mixture of experts</title><link>https://hrushikesh.dev/notes/moe/</link><pubDate>Mon, 17 Nov 2025 00:00:00 +0000</pubDate><guid>https://hrushikesh.dev/notes/moe/</guid><description>&lt;p>hey, i was trying out the composer 1 in cursor, and was trying to understand how is inference super fast and even the code is very accurate at the same time, and how much does it cost (interms of gpus usage) and how did they scale to all cursor users.&lt;/p>
&lt;p>so i started digging into a few articles and papers, got my hands on a few open source models like ibm-granite/granite-3.1 and mixtral 8x7b which are sparse mixture of experts models. but then, i didnt understand how does it work, was wondering what is 8x7b? is it 8 times better than 7b? or 8x the size of 7b? well, the model has 8 experts, each with 7 billion parameters mistral style model. what the hell are these experts?&lt;/p></description></item><item><title>how to almost reduce llm costs by 80%</title><link>https://hrushikesh.dev/notes/llm-costs/</link><pubDate>Sat, 01 Jun 2024 00:00:00 +0000</pubDate><guid>https://hrushikesh.dev/notes/llm-costs/</guid><description>&lt;p>If you are an AI developer or building a SAAS wrapped around AI foundational models, its really necessary to think about spending a lot of money on LLMs without any prior knowledge on how to reduce these costs..&lt;/p>
&lt;p>











&lt;figure class="">

 &lt;div class="img-container" >
 &lt;img loading="lazy" alt="routerbench" src="https://hrushikesh.dev/images/routerbench.png" >
 &lt;/div>

 
&lt;/figure>
&lt;/p>
&lt;h3 class="heading" id="change-the-model">
 change the model
 &lt;a class="anchor" href="#change-the-model">#&lt;/a>
&lt;/h3>
&lt;p>By replacing the LLM like GPT-4 with a small language models like phi-3 or Mistral for specific tasks that doesn&amp;rsquo;t need more precise and optimized responses. this way you can have major cost cuttings.&lt;/p></description></item></channel></rss>