all but RAG done in Input chapter

souzatharsis · Dec 28, 2024 · 568a08f · 568a08f
1 parent 405cb84
commit 568a08f
Show file tree

Hide file tree

Showing 53 changed files with 7,936 additions and 934 deletions.
diff --git a/tamingllms/_build/.doctrees/environment.pickle b/tamingllms/_build/.doctrees/environment.pickle
diff --git a/tamingllms/_build/.doctrees/markdown/preface.doctree b/tamingllms/_build/.doctrees/markdown/preface.doctree
diff --git a/tamingllms/_build/.doctrees/markdown/toc.doctree b/tamingllms/_build/.doctrees/markdown/toc.doctree
diff --git a/tamingllms/_build/.doctrees/notebooks/alignment.doctree b/tamingllms/_build/.doctrees/notebooks/alignment.doctree
diff --git a/tamingllms/_build/.doctrees/notebooks/cost.doctree b/tamingllms/_build/.doctrees/notebooks/cost.doctree
diff --git a/tamingllms/_build/.doctrees/notebooks/evals.doctree b/tamingllms/_build/.doctrees/notebooks/evals.doctree
diff --git a/tamingllms/_build/.doctrees/notebooks/input.doctree b/tamingllms/_build/.doctrees/notebooks/input.doctree
diff --git a/tamingllms/_build/.doctrees/notebooks/local.doctree b/tamingllms/_build/.doctrees/notebooks/local.doctree
diff --git a/tamingllms/_build/.doctrees/notebooks/safety.doctree b/tamingllms/_build/.doctrees/notebooks/safety.doctree
diff --git a/tamingllms/_build/.doctrees/notebooks/structured_output.doctree b/tamingllms/_build/.doctrees/notebooks/structured_output.doctree
diff --git a/tamingllms/_build/html/_images/2025.png b/tamingllms/_build/html/_images/2025.png
diff --git a/tamingllms/_build/html/_images/anth_contextual.png b/tamingllms/_build/html/_images/anth_contextual.png
diff --git a/tamingllms/data/input/asset_class.png → ...gllms/_build/html/_images/asset_class.png b/tamingllms/data/input/asset_class.png → ...gllms/_build/html/_images/asset_class.png
diff --git a/tamingllms/_build/html/_images/cic.png b/tamingllms/_build/html/_images/cic.png
diff --git a/tamingllms/_build/html/_images/deep.jpeg b/tamingllms/_build/html/_images/deep.jpeg
diff --git a/tamingllms/_build/html/_images/deep2.jpeg b/tamingllms/_build/html/_images/deep2.jpeg
diff --git a/tamingllms/_build/html/_images/diagram1.png b/tamingllms/_build/html/_images/diagram1.png
diff --git a/tamingllms/_build/html/_images/docling.png b/tamingllms/_build/html/_images/docling.png
diff --git a/tamingllms/_build/html/_images/forecast.png b/tamingllms/_build/html/_images/forecast.png
diff --git a/tamingllms/_build/html/_images/harvard.png b/tamingllms/_build/html/_images/harvard.png
diff --git a/tamingllms/_build/html/_images/markitdown.png b/tamingllms/_build/html/_images/markitdown.png
diff --git a/tamingllms/_build/html/_images/quiz.png b/tamingllms/_build/html/_images/quiz.png
diff --git a/tamingllms/_build/html/_sources/notebooks/input.ipynb b/tamingllms/_build/html/_sources/notebooks/input.ipynb
diff --git a/tamingllms/_build/html/_sources/notebooks/local.ipynb b/tamingllms/_build/html/_sources/notebooks/local.ipynb
@@ -181,11 +181,11 @@
     "Performance Comparison including proprietary models.\n",
     "```\n",
     "\n",
-    "Also from China, DeepSeek-V3 {cite}`deepseek2024v3` represents a major breakthrough in open source language models, emerging as arguably as the most capable open source large language model available today. With 671 billion parameters and 37 billion active MoE (Mixture of Experts) parameters, it achieves performance on par with leading proprietary models like Claude 3.5 Sonnet and GPT 4o as shown in {numref}`deep`. The model demonstrates impressive efficiency metrics (see {numref}`deep2`), processing input tokens at $0.27 per million and output tokens at $1.1 per million, while maintaining a generation speed of 60 tokens per second (3x faster than DeepSeek-V2).\n",
+    "Also from China, DeepSeek-V3 {cite}`deepseek2024v3` represents a major breakthrough in open source language models, emerging as arguably the most capable open source large language model available as of the end of 2024. With 671 billion parameters and 37 billion active MoE (Mixture of Experts) parameters, it achieves performance on par with leading proprietary models like Claude 3.5 Sonnet and GPT 4o as shown in {numref}`deep`. The model demonstrates impressive cost efficiency metrics (see {numref}`deep2`), processing input tokens at $0.27 per million and output tokens at $1.1 per million, while maintaining a generation speed of 60 tokens per second (3x faster than DeepSeek-V2).\n",
     "\n",
-    "What makes DeepSeek-V3 particularly remarkable is that these capabilities were achieved with a relatively modest training budget of just $5.5 million, used to train on 14.8 trillion tokens. This efficiency in training demonstrates the potential for open source models to compete with proprietary alternatives at a fraction of the cost. The model's release marks a significant milestone in the democratization of advanced AI capabilities, challenging the dominance of proprietary models.\n",
+    "What makes DeepSeek-V3 particularly remarkable is that these capabilities were achieved with a relatively modest training budget of just $5.5 million, used to train on 14.8 trillion tokens. This efficiency in training demonstrates the potential for open source models to compete with proprietary alternatives at a fraction of the cost. The model's release marks a significant milestone in the democratization of advanced AI capabilities, challenging the dominance of proprietary models within big tech. One should be cautious though as the model has not yet been battle-tested in the wild but this is an exciting development demonstrating the potential of open source models to compete with proprietary alternatives.\n",
     "\n",
-    "```{figure} ../_static/local/deep.png\n",
+    "```{figure} ../_static/local/deep.jpeg\n",
     "---\n",
     "name: deep\n",
     "alt: DeepSeek-V3\n",
@@ -195,7 +195,7 @@
     "DeepSeek-V3 Performance Comparison\n",
     "```\n",
     "\n",
-    "```{figure} ../_static/local/deep2.png\n",
+    "```{figure} ../_static/local/deep2.jpeg\n",
     "---\n",
     "name: deep2\n",
     "alt: DeepSeek-V3 Cost Benefit Analysis\n",

diff --git a/tamingllms/_build/html/_static/input/anth_contextual.png b/tamingllms/_build/html/_static/input/anth_contextual.png
diff --git a/tamingllms/_build/html/_static/input/asset_class.png b/tamingllms/_build/html/_static/input/asset_class.png
diff --git a/tamingllms/_build/html/_static/input/docling.png b/tamingllms/_build/html/_static/input/docling.png
diff --git a/tamingllms/_build/html/_static/input/markitdown.png b/tamingllms/_build/html/_static/input/markitdown.png
diff --git a/tamingllms/_build/html/genindex.html b/tamingllms/_build/html/genindex.html
@@ -160,6 +160,15 @@
 
 
 
+          </li>
+
+
+          <li class="toctree-l1 ">
+
+              <a href="notebooks/input.html" class="reference internal ">Managing Input Data</a>
+
+
+
           </li>
 
 

diff --git a/tamingllms/_build/html/markdown/intro.html b/tamingllms/_build/html/markdown/intro.html
@@ -178,6 +178,15 @@
 
 
 
+          </li>
+
+
+          <li class="toctree-l1 ">
+
+              <a href="../notebooks/input.html" class="reference internal ">Managing Input Data</a>
+
+
+
           </li>
 
 

diff --git a/tamingllms/_build/html/markdown/preface.html b/tamingllms/_build/html/markdown/preface.html
@@ -160,6 +160,15 @@
 
 
 
+          </li>
+
+
+          <li class="toctree-l1 ">
+
+              <a href="../notebooks/input.html" class="reference internal ">Managing Input Data</a>
+
+
+
           </li>
 
 
@@ -236,15 +245,15 @@ <h1><span class="section-number">1. </span>Preface<a class="headerlink" href="#p
 <div><p>Models tell you merely what something is like, not what something is.</p>
 <p class="attribution">—Emanuel Derman</p>
 </div></blockquote>
-<p>An alternative title of this book could have been “Language Models Behaving Badly”. If you are coming from a background in financial modeling, you may have noticed the parallel with Emanuel Derman’s seminal work “Models.Behaving.Badly” <span id="id1">[<a class="reference internal" href="#id169" title="E. Derman. Models.Behaving.Badly.: Why Confusing Illusion with Reality Can Lead to Disaster, on Wall Street and in Life. Free Press, 2011. ISBN 9781439165010. URL: https://books.google.co.uk/books?id=lke_cwM4wm8C.">Derman, 2011</a>]</span>. This parallel is not coincidental. Just as Derman cautioned against treating financial models as perfect representations of reality, this book aims to highlight the limitations and pitfalls of Large Language Models (LLMs) in practical applications.</p>
+<p>An alternative title of this book could have been “Language Models Behaving Badly”. If you are coming from a background in financial modeling, you may have noticed the parallel with Emanuel Derman’s seminal work “Models.Behaving.Badly” <span id="id1">[<a class="reference internal" href="#id176" title="E. Derman. Models.Behaving.Badly.: Why Confusing Illusion with Reality Can Lead to Disaster, on Wall Street and in Life. Free Press, 2011. ISBN 9781439165010. URL: https://books.google.co.uk/books?id=lke_cwM4wm8C.">Derman, 2011</a>]</span>. This parallel is not coincidental. Just as Derman cautioned against treating financial models as perfect representations of reality, this book aims to highlight the limitations and pitfalls of Large Language Models (LLMs) in practical applications.</p>
 <p>The book “Models.Behaving.Badly” by Emanuel Derman, a former physicist and Goldman Sachs quant, explores how financial and scientific models can fail when we mistake them for reality rather than treating them as approximations full of assumptions.
 The core premise of his work is that while models can be useful tools for understanding aspects of the world, they inherently involve simplification and assumptions. Derman argues that many financial crises, including the 2008 crash, occurred partly because people put too much faith in mathematical models without recognizing their limitations.</p>
 <p>Like financial models that failed to capture the complexity of human behavior and market dynamics, LLMs have inherent constraints. They can hallucinate facts, struggle with logical reasoning, and fail to maintain consistency across long outputs. Their responses, while often convincing, are probabilistic approximations based on training data rather than true understanding even though humans insist on treating them as “machines that can reason”.</p>
 <p>Today, there is this growing pervasive belief that these models could solve any problem, understand any context, or generate any content as wished by the user. Moreover, language models that were initially designed to be next-token prediction machines and chatbots are now been twisted and wrapped into “reasoning” machines for further integration into technology products and daily-life workflows that control, affect, or decide daily actions of our lives. This technological optimism coupled with lack of understanding of the models’ limitations may pose risks we are still trying to figure out.</p>
 <p>This book serves as an introductory, practical guide for practitioners and technology product builders - software engineers, data scientists, and product managers - who want to create the next generation of GenAI-based products with LLMs while remaining clear-eyed about their limitations and therefore their implications to end-users. Through detailed technical analysis, reproducible Python code examples we explore the gap between LLM capabilities and reliable software product development.</p>
 <p>The goal is not to diminish the transformative potential of LLMs, but rather to promote a more nuanced understanding of their behavior. By acknowledging and working within their constraints, developers can create more reliable and trustworthy applications. After all, as Derman taught us, the first step to using a model effectively is understanding where it breaks down.</p>
 <div class="docutils container" id="id2">
-<div class="citation" id="id169" role="doc-biblioentry">
+<div class="citation" id="id176" role="doc-biblioentry">
 <span class="label"><span class="fn-bracket">[</span><a role="doc-backlink" href="#id1">Der11</a><span class="fn-bracket">]</span></span>
 <p>E. Derman. <em>Models.Behaving.Badly.: Why Confusing Illusion with Reality Can Lead to Disaster, on Wall Street and in Life</em>. Free Press, 2011. ISBN 9781439165010. URL: <a class="reference external" href="https://books.google.co.uk/books?id=lke_cwM4wm8C">https://books.google.co.uk/books?id=lke_cwM4wm8C</a>.</p>
 </div>

diff --git a/tamingllms/_build/html/markdown/toc.html b/tamingllms/_build/html/markdown/toc.html
@@ -153,6 +153,15 @@
 
 
 
+          </li>
+
+
+          <li class="toctree-l1 ">
+
+              <a href="../notebooks/input.html" class="reference internal ">Managing Input Data</a>
+
+
+
           </li>
-Original file line number
+Diff line change
@@ Expand Up / @@ -160,6 +160,15 @@ @@
+              </li>
+              <li class="toctree-l1 ">
+                  <a href="notebooks/input.html" class="reference internal ">Managing Input Data</a>
               </li>
@@ Expand Down @@