<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[GoodReason AI Blog]]></title><description><![CDATA[Blog of GoodReason AI.]]></description><link>https://blog.goodreason.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!zvqI!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc245cfcd-5180-4b31-996d-0bd68431f350_1280x1280.png</url><title>GoodReason AI Blog</title><link>https://blog.goodreason.ai</link></image><generator>Substack</generator><lastBuildDate>Fri, 01 May 2026 01:37:10 GMT</lastBuildDate><atom:link href="https://blog.goodreason.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[U.S. Artificial Intelligence Inc.]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[goodreason@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[goodreason@substack.com]]></itunes:email><itunes:name><![CDATA[Gordon Kamer]]></itunes:name></itunes:owner><itunes:author><![CDATA[Gordon Kamer]]></itunes:author><googleplay:owner><![CDATA[goodreason@substack.com]]></googleplay:owner><googleplay:email><![CDATA[goodreason@substack.com]]></googleplay:email><googleplay:author><![CDATA[Gordon Kamer]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[A.I. Tells Me How to Vote]]></title><description><![CDATA[How to become an informed voter with Abbey]]></description><link>https://blog.goodreason.ai/p/ai-tells-me-how-to-vote</link><guid isPermaLink="false">https://blog.goodreason.ai/p/ai-tells-me-how-to-vote</guid><dc:creator><![CDATA[Gordon Kamer]]></dc:creator><pubDate>Mon, 04 Nov 2024 17:00:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!IBRl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Misinformation, dark money internet trolls, and deep-fakes: these are the issues that try men&#8217;s souls regarding AI and elections. But there&#8217;s another aspect that politicians don&#8217;t talk about as much &#8211; though they should. If you ask AI to help you research and recommend candidates, what does it say? Is it biased, a helpful tool, both? People often use AI as a replacement for Google. So if search results are biased, how are AI chats?</p><p>Anyone who lives in California knows that a ballot resembles a textbook &#8211; more of a <em>Das Kapital</em> than a lightweight <em>Communist Manifesto</em> &#8211; oversized envelopes carried by burdened postmen in their righteous democratic quest. </p><p>I wanted to know what AI thinks of the litany of federal, state, and local elections &#8211; plus propositions 2, 3, 4, 5, 6, 32, 33, 34, 35, 36, G, and AA? (the last two being specific to Mountain View, where I live).</p><p><strong>Key Points From This Post</strong></p><ul><li><p>It&#8217;s easy to <strong>create tables comparing candidates&#8217; positions automatically</strong>, even in lesser known elections.</p></li><li><p>AI can be strongly influenced by its source material, like newspaper endorsements that people might not otherwise read.</p></li><li><p>It can help <strong>fact check issues</strong> or double check figures, like getting Adam Schiff&#8217;s actual campaign rhetoric on price controls, or CA&#8217;s state budget &#8211; <strong>all with proper citations</strong>, <strong>which themselves can be brought in easily to chat with</strong>.</p></li><li><p>AI can help <strong>quickly identify the rough political valence of the candidate</strong>, including intra-party distinctions (moderate vs. radical) much more easily, I find, than using Wikipedia or relying on campaign ads.</p></li><li><p>It can be helpful to get <strong>more political context</strong> on issues. For Prop 34, it let me know that it is primarily targeting one organization, the AIDS Foundation, which is using its money to support rent control and purchase real estate (widely reported, but perhaps not widely known to voters).</p></li></ul><p>You can find all of my notes in an <a href="https://abbey.us.ai/assets/34546">Abbey Workspace</a>. An outline tab on the right side was automatically generated by Abbey to organize everything by office. Nearly every note in the workspace was saved with one click from a chat which used Abbey&#8217;s &#8220;Use Web&#8221; feature with Claude Sonnet 3.5 selected. See the Workspace on Abbey to view the proper citations.</p><div><hr></div><h1><strong>U.S. Senate</strong></h1><p>First, let&#8217;s get biographies for each candidate:</p><blockquote><p><em>Steve Garvey:</em></p><p><em>Steve Garvey is a former Major League Baseball player running as a Republican candidate for U.S. Senate in California. Born in Tampa, Florida, Garvey earned a B.S. from Michigan State University. He played first base for the Los Angeles Dodgers and San Diego Padres from 1969 to 1987, becoming a 10-time National League All-Star. After his baseball career, Garvey founded Garvey Marketing Group and worked with various charitable causes. This is Garvey's first time running for political office.</em></p><p><em>Adam Schiff:</em></p><p><em>Adam Schiff is a Democratic congressman who has represented communities in Los Angeles and the San Gabriel Valley for nearly three decades. Born in 1960, Schiff worked as an assistant U.S. attorney before being elected to the California State Senate in 19963. He was elected to the U.S. House of Representatives in 2001 and is currently serving his 12th term. Schiff gained national prominence as chair of the House Intelligence Committee from 2019 to 2023 and for his role in leading the first impeachment of President Donald Trump. He also served on the January 6th committee investigating the Capitol attack.</em></p></blockquote><p>Now, let&#8217;s dig into the issues with a table comparing their stances:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IBRl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IBRl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png 424w, https://substackcdn.com/image/fetch/$s_!IBRl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png 848w, https://substackcdn.com/image/fetch/$s_!IBRl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png 1272w, https://substackcdn.com/image/fetch/$s_!IBRl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IBRl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:433015,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IBRl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png 424w, https://substackcdn.com/image/fetch/$s_!IBRl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png 848w, https://substackcdn.com/image/fetch/$s_!IBRl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png 1272w, https://substackcdn.com/image/fetch/$s_!IBRl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cda00b0-7465-4d45-a352-3dd72fd640cf_1944x1094.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>After reading this, I was interested in Schiff&#8217;s stance on &#8220;price gouging rules&#8221; &#8211; is that really important to him? I saw that the source was a KTVU article, so I opened a chat with just that article, and asked for more. Something interested me there:</p><blockquote><ol start="2"><li><p>Grocery Industry Focus: Schiff highlights the concentration of power in the grocery industry as an example, noting that "more than 60% of America's grocery sales are now concentrated within just four companies, creating a virtual monopoly that is causing higher prices for shoppers and fewer grocery store job opportunities"</p></li></ol></blockquote><p>I asked, &#8220;What is the profit margin of a typical grocery store?&#8221; <a href="https://www.grocerydive.com/news/grocery-industry-profit-margins-fall-to-pre-pandemic-levels-fmi/720517/">Answer: 2.2%.</a></p><h1><strong>U.S. House</strong></h1><p></p><p>This is a highly contested election between Evan Low and Sam Liccardo (both Democrats) to replace a long-serving Representative.</p><blockquote><p>Here are brief biographies of Evan Low and Sam Liccardo, who are both candidates for U.S. Representative in California's 16th congressional district:</p><p>Evan Low:</p><p>Evan Low is a Democratic politician currently serving in the California State Assembly, representing the 26th Assembly District.</p><p>Some key points about Low's background and career:</p><ul><li><p>Born in 1983 in San Jose, California</p></li><li><p>Elected to the Campbell City Council in 2006 at age 23</p></li><li><p>Became mayor of Campbell in 2009, making him the youngest openly gay Asian American mayor in the nation at age 26</p></li><li><p>Elected to the California State Assembly in 2014</p></li><li><p>Has served as Chair of the California Asian American &amp; Pacific Islander Legislative Caucus</p></li><li><p>Co-founder and co-chair of the California Legislative Tech Caucus</p></li><li><p>Focused on issues like technology policy, LGBTQ+ rights, and affordable housing in his legislative career</p></li></ul><p>Low is running for Congress on priorities including protecting abortion rights, lowering the cost of living, addressing homelessness, and promoting technology and innovation.</p><p>Sam Liccardo:</p><p>Sam Liccardo is a Democratic politician who most recently served as the mayor of San Jose, California from 2015 to 2022. Key points about Liccardo's background:</p><ul><li><p>Born in 1970 in Saratoga, California</p></li><li><p>Graduated from Georgetown University and Harvard Law School</p></li><li><p>Worked as a federal prosecutor and criminal prosecutor for the Santa Clara County District Attorney's Office</p></li><li><p>Elected to the San Jose City Council in 2006</p></li><li><p>Elected mayor of San Jose in 2014 and re-elected in 2018</p></li><li><p>As mayor, focused on issues like pension reform, economic development, homelessness, and environmental initiatives</p></li></ul><p>In his congressional campaign, Liccardo has emphasized his record as mayor and his plans to address issues like the cost of living, public safety, and technology policy.</p><p>Both candidates are vying to replace retiring U.S. Representative Anna Eshoo in California's 16th congressional district.</p></blockquote><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c7x7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c7x7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png 424w, https://substackcdn.com/image/fetch/$s_!c7x7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png 848w, https://substackcdn.com/image/fetch/$s_!c7x7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png 1272w, https://substackcdn.com/image/fetch/$s_!c7x7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c7x7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png" width="1456" height="660" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:660,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:304698,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c7x7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png 424w, https://substackcdn.com/image/fetch/$s_!c7x7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png 848w, https://substackcdn.com/image/fetch/$s_!c7x7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png 1272w, https://substackcdn.com/image/fetch/$s_!c7x7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46d72e3a-871d-4630-9f8e-8b92994dfde7_1932x876.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As you can see, it&#8217;s very easy to pinpoint Sam Liccardo as the moderate from this table.</p><p></p><h1>Other Elections and Propositions</h1><p></p><p>Here is Abbey&#8217;s concluding comment on a judicial race:</p><blockquote><p>It's worth noting that while both candidates have significant experience in the District Attorney's Office, Boyarsky has taken on more administrative roles in recent years, while Stebbins has maintained a focus on prosecutorial work. The San Jose Mercury News editorial board favors Boyarsky, citing his management experience and broad support.</p></blockquote><p>This paragraph mentions the Mercury News&#8217; endorsement of Boyarsky&#8217;s for the second time in its response, giving Boyarsky the last word in its summary. The model might over-index on media endorsements (which get put into the model&#8217;s &#8220;context window&#8221; and might cause significant influence on language). </p><p>Another thing I&#8217;ll note &#8211; in a race involving &#8220;engineer&#8221; Alexander Glew, I asked what kind of an engineer he was. Apparently, he has a PhD in Materials Science from Stanford and wrote his thesis on chemical vapor deposition &#8211; a bit more expertise than necessary to be called just an &#8220;engineer&#8221; in the valley, as he is described in the media.</p><p>You can find more on my Abbey Workspace <a href="https://abbey.us.ai/assets/34546">here</a>. </p><p></p><h1>Conclusion</h1><p></p><p>The end, I gave a voter description and asked whom that voter would support in each race. I wrote, &#8220;I&#8217;m a moderate conservative who favors market based approaches to policy. Based on that, who would I want to support in this election, for each of the offices?&#8221;</p><blockquote><p>Based on your self-description as a moderate conservative who favors market-based approaches, here are some suggestions for candidates you might consider supporting in this election:</p><ol><li><p>U.S. Senate: Steve Garvey (R). Garvey's positions on controlling federal spending, supporting supply-side economics, and emphasizing law enforcement align more closely with moderate conservative views.</p></li><li><p>U.S. House (16th District): Sam Liccardo. While both candidates are Democrats, Liccardo tends to present a more moderate stance on some issues compared to Low.</p></li><li><p>State Senate (District 13): Alexander Glew (R). Glew's background as an engineer and business owner, along with his priorities like resolving unfunded pension liability and reducing business regulation, may appeal to a moderate conservative.</p></li><li><p>State Assembly (District 23): Lydia Kou. Although both candidates are Democrats, Kou's opposition to recent state housing bills and emphasis on local control aligns more with conservative views on government intervention.</p></li><li><p>Santa Clara County Board of Supervisors (District 5): Margaret Abe-Koga. While both candidates are Democrats, Abe-Koga has received support from business-oriented groups like the California Apartment Association, which might indicate more market-friendly policies.</p></li><li><p>Mountain View City Council: John McAlister. As a former council member, local business owner, and certified public accountant, McAlister's focus on solving issues rather than making policy and his aim to revitalize local businesses aligns with moderate conservative views.</p></li><li><p>El Camino Healthcare District: John Zoglin. With his background in the high-tech industry and healthcare analytics, and focus on delivering strong financial results, Zoglin might appeal to a market-oriented conservative.</p></li></ol><p>Remember, these suggestions are based on limited information and your stated preferences. It's always important to research candidates thoroughly and consider their full platforms before making your decision.</p></blockquote><p></p><p>I would say&#8211; not bad, but I&#8217;m not sure a market-oriented conservative would support anti-housing Lydia Kou. Perhaps it didn&#8217;t put enough valence on &#8220;market based approaches to policy,&#8221; or didn&#8217;t quite know what the state bills were. On the other hand, it&#8217;s possible that a conservative instinct toward local control might override an impulse to deregulate. </p><p>On propositions: you can find notes I made in the <a href="https://abbey.us.ai/assets/34546">Abbey Workspace</a>.</p><p><strong>And as for the presidential race, if you need AI to help you at this point (either way): good luck.</strong></p>]]></content:encoded></item><item><title><![CDATA[Defusing the Population Bomb with Abbey]]></title><description><![CDATA[How to debunk a popular myth using Abbey]]></description><link>https://blog.goodreason.ai/p/defusing-the-population-bomb-with</link><guid isPermaLink="false">https://blog.goodreason.ai/p/defusing-the-population-bomb-with</guid><dc:creator><![CDATA[Jack Montinaro]]></dc:creator><pubDate>Thu, 19 Sep 2024 16:00:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vZqV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vZqV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vZqV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp 424w, https://substackcdn.com/image/fetch/$s_!vZqV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp 848w, https://substackcdn.com/image/fetch/$s_!vZqV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp 1272w, https://substackcdn.com/image/fetch/$s_!vZqV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vZqV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp" width="520" height="520" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:976,&quot;width&quot;:976,&quot;resizeWidth&quot;:520,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!vZqV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp 424w, https://substackcdn.com/image/fetch/$s_!vZqV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp 848w, https://substackcdn.com/image/fetch/$s_!vZqV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp 1272w, https://substackcdn.com/image/fetch/$s_!vZqV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7c04702-cdd3-44bd-a169-6450dcf34aa0_976x976.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In 1700, the world&#8217;s population <a href="https://abbey.us.ai/assets/27270">was around 600 million</a>. By the 1800s, it was 1 billion, and by 1925, it was 2 billion. By 1968, an astonishing 3.5 billion people lived on a planet that a short time before had not been able to sustain even a fraction of that amount. So it was not radical to believe that mass starvation and scarcity was just around the corner: Earth was full. This belief was the thesis of the popular and influential book <em>The Population Bomb</em>, written by biologist Paul Ehrlich in 1968.</p><p>However, Ehrlich would be shocked to learn that in the years since his book was published, rates of poverty and starvation have actually <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3777609/">declined</a> &#8211; his &#8220;population bomb&#8221; fizzled into disrepute.</p><p>So how can we debunk Ehrlich&#8217;s work using Abbey? We&#8217;ll find material using web search in Chat, and keep notes and summarize sources in Workspace. For any website we find, we&#8217;ll use Abbey&#8217;s <a href="https://chromewebstore.google.com/detail/abbey/lajhghkelnmbdapgcgmdbfolbicigeac?hl=en-US&amp;pli=1">Chrome extension</a> to bring it into the Workspace to chat with.</p><p>By the end, we should have a strong understanding of what exactly Ehrlich got wrong and how you can replicate the research on Abbey. You can find the Abbey Workspace for this article <a href="https://abbey.us.ai/assets/26645">here</a><strong>.</strong></p><p></p><p><strong>The Book</strong></p><p>Before publishing <em>The Population Bomb</em>, Paul Ehrlich was already recognized for his research on butterflies and evolutionary biology. His observation of population dynamics in various species, alongside his growing concern about human overpopulation, propelled him into the limelight. Having witnessed firsthand the effects of dense populations on ecosystems and resource competition, Ehrlich was convinced that similar principles applied to humans. This belief crystallized during a sabbatical in India in the mid-1960s, where Ehrlich was overwhelmed by the staggering number of people living in dire conditions. This experience became a driving force behind the writing of The Population Bomb. Upon returning to the United States, Ehrlich resolved to focus his academic and public efforts on raising awareness about the impending demographic crisis. His work aimed to integrate rigorous scientific understanding with accessible public discourse to mobilize action on population control.</p><p>With Abbey we can create a one-click summary of the text, given here:</p><blockquote><p><em>The Population Bomb by Paul R. Ehrlich discusses the urgent problem of overpopulation and its consequences. Ehrlich emphasizes that overpopulation is the most critical threat and affects personal, national, and international planning. He suggests that rational planning can only begin once population issues are addressed. Ehrlich provides his credentials as an authority in population biology, mentioning his position at Stanford University and his extensive publication record.</em></p><p><em>The book outlines various critical themes including:</em></p><ul><li><p><em>Overpopulation leading to insufficient food production, a dying planet, and escalating socio-environmental issues.</em></p></li><li><p><em>Scenarios of possible future crises such as famine, pandemics, and war that could result from unchecked population growth.</em></p></li><li><p><em>Failures of family planning programs, particularly in UDCs (Underdeveloped Countries), where birth control measures are often inefficient and poorly implemented.</em></p></li><li><p><em>Necessary actions like global family planning, reconsideration of socio-political structures, and reinforcement of environmental laws to stabilize population growth.</em></p></li></ul><p><em>Ehrlich advocates for immediate and sustained global efforts towards population control and environmental preservation to prevent catastrophic outcomes. He also discusses the role of technology, international aid, and shifts in societal behaviors necessary for these changes. The ten-year update included in later editions reflects on changes seen since the book's initial publication, highlighting some progress in developed countries but recognizing ongoing challenges globally. The recommended reading section suggests further resources for understanding and addressing population and environmental issues.</em></p></blockquote><p></p><p><strong>The Green Revolution</strong></p><p>Ehrlich states that food production cannot keep up, citing stunted growth in the early 70s. When asking Abbey about evidence Ehrlich cited, it pointed me to this page with the excerpt,&nbsp;</p><blockquote><p>&#8220;When the Soil Bank was abolished in 1975, planted acreage was increased by 12% over the early 1970s. But 1975 production &#8211; a "bumper" year &#8211; was less than 2% higher than that of 1973, the previous record year, and average yield -(production per acre) was 2% lower than in 1973.&#8221;</p><p>Pg. 210, 225 on PDF.</p></blockquote><p>He believed the recent advancements in the &#8220;Green Revolution&#8221; could not keep up. The emergence of high yielding crop varieties is dwarfed by the depletion of precious groundwater reserves and soil erosion. Furthermore, the heavy reliance on chemical fertilizers and pesticides had led to increased pollution and a significant loss of biodiversity. Ehrlich argues that these modern agricultural practices had introduced a new set of environmental and social challenges, including greater vulnerability to pests and diseases due to monocropping. The sustainability of such practices is highly questionable, as they often result in long-term ecological damage while offering short-term yield gains. To address these issues, Ehrlich advocated for a comprehensive approach that integrates sustainable agricultural techniques, improved water management, and a focus on preserving natural ecosystems. Without these measures, he warns, the progress made in food production during the Green Revolution could unravel, leaving future generations to face even more severe food security crises.&nbsp;</p><p>Ehrlich, however, underestimated human ingenuity. With Abbey, we can search the web and bring in examples of innovations into our workspace. High yield varieties transformed wheat production, IR8 or &#8220;miracle rice&#8221; revolutionized rice farming in Asia, drip irrigation being developed in Israel, all part of the green revolution becoming more widespread around the time his work is published. Innovative agricultural advancements like these showcased humanity&#8217;s resilience and ingenuity. Rather than succumbing to famine, societies embraced scientific innovations to enhance food security and improve livelihoods. The Green Revolution's profound impact was exemplified by the dramatic increases in crop production and reductions in malnutrition across various regions, and these side effects were greatly overestimated relative to the yield increase.</p><p></p><p><strong>Population Control Suggestions</strong></p><p>Dr. Ehrlich&#8217;s suggestions ranged from voluntary methods to more controversial and compulsory approaches. On the voluntary side, Ehrlich advocated for financial incentives to discourage childbearing, such as offering "responsibility prizes" for childless marriages and implementing luxury taxes on child-related items like diapers and cribs. He also suggested a direct tax on children to further discourage large families. Ehrlich recognized the power of media and public awareness, making appearances on popular shows like "The Tonight Show" to spread his message about the dangers of overpopulation. These efforts helped grow his organization, Zero Population Growth, and brought the issue to the forefront of public consciousness.&nbsp;</p><p>His promotion of birth control gained him support from women's health advocates, aligning population control with reproductive rights. However, Ehrlich's proposals also ventured into more contentious territory. He suggested creating blacklists of individuals, companies, and organizations that he believed were impeding population control efforts in the United States. Perhaps most controversially, there are reports of Ehrlich speculating about adding contraceptives to the public water supply to reduce fertility rates on a broad scale. While many of his more extreme suggestions were never implemented, Ehrlich's alarmist approach created a sense of urgency and even panic about population growth.</p><p></p><p><strong>Demographic Transitions</strong></p><p>What Ehrlich failed to foresee was how the birth rate changes as countries develop. As nations progress economically and socially, they often experience a demographic transition that includes a rapid decline in birth rates. Initially, both birth and death rates are high, keeping population growth relatively stable. However, as public health, education, and economic conditions improve, death rates begin to fall. This stage is quickly followed by a substantial decline in birth rates as people adapt to new economic realities where smaller families become more advantageous. Increased access to education, particularly for women, greater employment opportunities, and improved healthcare contribute significantly to decreasing fertility rates. This demographic transition has already been observed in many developing countries, resulting in slower population growth and even population decline in some areas.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mmPm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mmPm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png 424w, https://substackcdn.com/image/fetch/$s_!mmPm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png 848w, https://substackcdn.com/image/fetch/$s_!mmPm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png 1272w, https://substackcdn.com/image/fetch/$s_!mmPm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mmPm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png" width="1456" height="1299" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1299,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mmPm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png 424w, https://substackcdn.com/image/fetch/$s_!mmPm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png 848w, https://substackcdn.com/image/fetch/$s_!mmPm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png 1272w, https://substackcdn.com/image/fetch/$s_!mmPm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fdf2299-f1a2-4f17-8522-991526912ffe_1600x1428.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>Courtesy: Our World In Data</em></p><p></p><p>While we see the overall growth rate fall, this aggregate number is made up of differing stories from different countries. Looking at the chart, and using image chat with Abbey, we see developed nations have been actually struggling with stunted growth rates.</p><p></p><p><strong>Challenges of Aging Populations</strong></p><p>Societies with aging populations face a range of interconnected challenges, particularly in the economic and healthcare sectors. Economically, these societies often struggle with a shrinking workforce, which can lead to reduced productivity and slower economic growth. This demographic shift puts increased pressure on public finances, as pension and healthcare costs rise while the tax base narrows. There's also concern that an aging population might result in a decline in innovation and entrepreneurship, traditionally associated with younger demographics.</p><p>The healthcare system in aging societies faces significant strain as well. There's a greater demand for healthcare services and long-term care, which can lead to rising costs for medical treatments and eldercare. This increased demand puts pressure on healthcare infrastructure and resources, potentially affecting the quality and accessibility of care for all age groups.</p><p>The sustainability of social security systems becomes a pressing issue in aging societies. With fewer working-age individuals supporting a larger retired population, there's often a need to reevaluate and potentially restructure these systems. This might involve reducing benefits, increasing the retirement age, or finding other solutions to ensure the system's long-term viability. Policymakers face the challenge of balancing the needs of the aging population with the economic realities of a changing demographic landscape.</p><p></p><p><strong>Conclusion</strong></p><p>This exercise served as one of the ways you can use Abbey Workspace to learn about something new in Abbey. I hope you enjoyed following along by revisiting one of the more controversial works from the late 20th century.</p>]]></content:encoded></item><item><title><![CDATA[When Soviet Scientists Won the Cold War: Understanding Stealth with Abbey]]></title><description><![CDATA[A brief history and introduction to Pyotr Ufimstev's foundational work on stealth technology, using Abbey.]]></description><link>https://blog.goodreason.ai/p/when-soviet-scientists-won-the-cold</link><guid isPermaLink="false">https://blog.goodreason.ai/p/when-soviet-scientists-won-the-cold</guid><dc:creator><![CDATA[Gordon Kamer]]></dc:creator><pubDate>Tue, 26 Mar 2024 17:36:53 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!6NLj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In the early 1970s, Lockheed Martin engineer Denys Overholser discovered the key to stealth technology hidden in a stack of translated Soviet technical papers. Disregarded by the Soviet academic elite, and unheard of in the United States, Pyotr Ufimstev had worked out calculations that would help win the Cold War &#8211; but for the other side.</p><p>I&#8217;d like to provide some historical background and then work through a small part of Ufimtsev&#8217;s paper, using <a href="https://abbey.us.ai">Abbey</a>. You can find a course made to take you through some of the paper <a href="https://abbey.us.ai/assets/11311">here</a>, which provides general scientific background (textbook chapters), notes, quizzes, shared chats, and chat-able documents.</p><p><strong>Historical Background</strong></p><p>Soviet censors didn&#8217;t believe that Pyotr Ufimstev&#8217;s obscure book <em>Method of Edge Waves in the Physical Theory of Diffraction</em> had much use, so they permitted its publication and translation into English. But when the book arrived at Lockheed Martin&#8217;s <a href="https://abbey.us.ai/assets/11606">Skunk Works</a> division &#8211; the famed team behind the SR-71 Blackbird and the U-2 spy plane &#8211; a bright engineer quickly <a href="https://www.sandboxx.us/news/the-air-force-made-lockheeds-skunk-works-design-a-stealth-pole/">understood</a> that it was exactly what he needed.</p><p>The U.S. had a problem: American-designed planes used in the Yom Kippur War <a href="https://abbey.us.ai/assets/11288">proved shockingly vulnerable</a> to Soviet radar, resulting in a devastating loss of aircraft. Military aircraft manufacturers like Lockheed Martin and Northrop Grumman tried their best to reduce their planes&#8217; radar cross section &#8211; the measurement of how large an object would appear on radar &#8211; for years, but even the stealthiest planes, like the U-2 spy plane, still lit up Russian radar.</p><p>Ufimstev&#8217;s paper provided a superior method for calculating radar cross sections, which had previously been imprecise. An important implication of his work, which was new to many, was that an aircraft&#8217;s shape &#8211; not its size or material composition &#8211; was the most important factor in determining how large it would appear to radar systems. American designers could calculate the radar cross section of a plane&#8217;s design on computers and optimize for stealthiness.</p><p>The first plane designed using this method was the F-117 Nighthawk. Its radar cross section was orders of magnitude below anything ever designed before. In fact, at the military&#8217;s radar proving ground, where Lockheed was competing for a project, Lockheed complained that the pole on which their plane model was sitting had a larger radar cross section than the plane itself. So Lockheed developed a &#8220;stealth pole&#8221; to mount the stealth plane. When Northrop&#8217;s program manager saw the results of the tests on the new pole, he <a href="https://www.sandboxx.us/news/the-air-force-made-lockheeds-skunk-works-design-a-stealth-pole/">said</a>, &#8220;If they can do that with a frigging pole, what can they do with their damned model?&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6NLj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6NLj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png 424w, https://substackcdn.com/image/fetch/$s_!6NLj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png 848w, https://substackcdn.com/image/fetch/$s_!6NLj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png 1272w, https://substackcdn.com/image/fetch/$s_!6NLj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6NLj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png" width="710" height="473.00824175824175" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:710,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6NLj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png 424w, https://substackcdn.com/image/fetch/$s_!6NLj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png 848w, https://substackcdn.com/image/fetch/$s_!6NLj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png 1272w, https://substackcdn.com/image/fetch/$s_!6NLj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F775fa06e-3547-47a7-8a5c-4ee8d9f6e42c_1600x1066.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The plane would be a closely guarded secret for years, first entering major combat operations during Desert Storm. It would <a href="https://abbey.us.ai/assets/12407">fly</a> 1,300 sorties in Iraq without being downed once.</p><p>In this story, the great American advance came not from a new brilliant inspiration or new experiments, but from a close reading of the literature. I hope that Abbey can help more people read and research as effectively as the engineers at the Skunk Works.</p><p><em>Note: The above story is from Skunk Works: A Personal Memoir of My Years at Lockheed by Ben Rich and Leo Janos (first published in 1994). Other works tell the history of stealth differently. They may emphasize, for example, U.S. supercomputing superiority, which allowed the calculations to be performed; the improved onboard computer systems required to fly a plane whose shape was not optimized for flight; and other papers published in the United States around the early 1970s with implications for stealth (see a Northrop paper <a href="https://ece-research.unm.edu/summa/notes/In/0101.pdf">here</a>, published in 1972, which cites other American sources).</em></p><p>&#8212;</p><p><strong>The Book: </strong><em><strong>Method of Edge Waves in the Physical Theory of Diffraction</strong></em></p><p>Dennys Overholser <a href="https://www.statesmanjournal.com/story/news/2016/04/16/secret-weapon-stealth-tech-dallas/82678480/">called</a> the paper &#8220;so obtuse and impenetrable that only a nerd's nerd would have waded through it all.&#8221; I&#8217;ll try and wade through some and leave the rest as an exercise to the reader.</p><p>Even if you don&#8217;t have much of a background in this subject, you might be surprised at how much you can learn by going through the <a href="https://abbey.us.ai/assets/11311">course</a> on Abbey if you&#8217;re willing to put in some effort. For comments or corrections, please reach out to <a href="mailto:gordon@us.ai">gordon@us.ai</a>.</p><p><strong>Technical Preliminaries</strong></p><p>Radar waves, like visible light, radio waves, or microwaves, are electromagnetic waves. These waves travel through space with a particular frequency. Radio and radar waves generally have very low frequencies, while microwaves or gamma rays have very high frequencies (visible light is in the middle). Radar seeks to detect distant objects by transmitting electromagnetic waves and then detecting those waves&#8217; reflection off objects.</p><p><em>What Are Electromagnetic Waves?</em></p><p>Electricity and electromagnetic waves are closely related. Electric current is generated when there is a voltage potential across a wire. On the negative end of the battery there is an excess of electrons, and on the positive end there is a dearth of electrons. Electrons nearest the positive end of the wire will drift toward the positive terminal of the battery, and electrons on the negative end will drift outward.</p><p>Now to consider electromagnetic waves again: according to <a href="https://abbey.us.ai/assets/15496">Ampere&#8217;s Law</a>, an electric current generates a magnetic field around itself. An electromagnetic wave is caused by <em>changes</em> in the electric and magnetic fields. So a current that is changing direction a lot (like an alternating current) simultaneously changes the magnetic field around it and generates a wave spreading out from the wire (<em>electro</em> - <em>magnetic</em>). The frequency with which these electrons wiggle back and forth is the frequency of the wave. The wave propagates via photons, which carry information about this frequency.</p><p>When the wave (the photons) hits a conducting body, the electromagnetic wave induces an oscillating current &#8211; effectively the reverse of the process just described. In a material that <em>reflects</em> electromagnetic waves, it means that the wave strikes the body, which generates current, which then re-generates the wave (i.e., re-releases the photons). Metal reflects electromagnetic waves very well.</p><p><em>As a side note: You may ask, &#8220;What exactly is doing the waving, and what does the wave look like in space?&#8221; There is no good answer. Some people say that &#8220;light is both a particle (a photon) and a wave,&#8221; but this description is often unhelpful. A very good source for learning about these phenomena is <a href="https://www.youtube.com/watch?v=bAnnEiAjN6U">Richard Feynman&#8217;s lectures on Quantum Electrodynamics.</a></em></p><p><strong>Foreword</strong></p><p>The foreword of the paper starts defining the scope of &#8220;the physical theory of diffraction&#8221; (the title of the work) and where it differs from prior work. It starts with reference to &#8220;geometric optics,&#8221; an old theory that describes light as rays (straight lines), and its counterpart, the &#8220;geometric theory of diffraction.&#8221; It also describes &#8220;physical optics&#8221;, which uses a more modern wave interpretation of light. This paper is its diffraction theory counterpart.</p><p>The foreword explains how it differs from the previously proposed &#8220;geometric theory of diffraction&#8221;:</p><blockquote><p>In the &#8216;geometric theory of diffraction&#8217; proposed by Keller, [...] the concept of diffraction rays advanced to the forefront. This concept was expressed rather as a physical hypothesis and was not suitable for representing the field in all of space: it was not usable where the formation of the diffraction field takes place (at the caustic, at the boundary of light and shadow, etc.). Here it is impossible to talk about rays, and one must use a wave interpretation. (Page v)</p></blockquote><p>So the old theory of diffraction relied upon an imprecise interpretation of light as things moving in straight lines. However, this interpretation is inadequate when examining a body closely (like at the &#8220;boundary between light and shadow&#8221; &#8211; i.e., at the edge). The new theory relies on a more precise wave interpretation of light.</p><p>Do we care about this difference in accuracy? The foreword explains the use-case:</p><blockquote><p>It is clear that the physical theory of diffraction is a step forward in comparison with physical optics, which in general neglects the additional (edge) currents. The results obtained in this book show that with a given wavelength the physical theory of diffraction gives a better precision than physical optics, and with a given precision <strong>the physical theory of diffraction allows one to advance into the longer wave region and, in particular, to obtain a number of results which are of interest for radar where the ratios of the dimensions of the bodies to the wavelength do not reach such large values as in optics.</strong> In addition, the physical theory of diffraction encompasses a number of interesting phenomena which are entirely foreign to physical optics. Thus, <strong>in a number of cases the additional currents give, not a small correction to the radiation field, but the main contribution to this field</strong> (see especially Chapters IV and V). (Page vi)</p></blockquote><p>So while the old theories may give good-enough results for optical waves (light that you can see traveling around like a ray), this theory remedies significant errors for longer waves (i.e., radar).</p><p><em>What is Diffraction?</em></p><p>You may still be wondering what &#8220;diffraction&#8221; means (which is not defined in the paper). Diffraction occurs when a wave meets an obstacle or an opening. It is the bending of the wave around the obstacle (if it bends a lot, the object might have no &#8220;shadow&#8221;). Think of a rock in a pool of water: waves with a high wavelength will seem to pass around it, while shorter waves will leave a visible trail of still water behind the rock where the waves could not reach. You can refer to an explanatory video <a href="https://www.youtube.com/watch?v=1bHipDSHVG4">here</a>.</p><p>The relevance to this paper (and to radar) &#8211; besides that &#8220;diffraction&#8221; is in the title &#8211; is that the calculations involving &#8220;diffraction currents&#8221; give the correction factors attributable to this theory. In the words of the paper, &#8220;In certain cases, one is obliged to consider the diffraction interaction of the various edges &#8211; that is, the fact that the wave created by one edge and propagated past another edge is diffracted by it (secondary diffraction).&#8221; (Page v)</p><p><strong>Introduction</strong></p><p>Now we enter the paper. Ufimtsev first demonstrates a physical optics approach to understanding a scattered wave (i.e., reflected, but not only straight back). The first equation is:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!e879!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!e879!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png 424w, https://substackcdn.com/image/fetch/$s_!e879!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png 848w, https://substackcdn.com/image/fetch/$s_!e879!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png 1272w, https://substackcdn.com/image/fetch/$s_!e879!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!e879!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png" width="180" height="56.470588235294116" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:224,&quot;width&quot;:714,&quot;resizeWidth&quot;:180,&quot;bytes&quot;:25636,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!e879!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png 424w, https://substackcdn.com/image/fetch/$s_!e879!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png 848w, https://substackcdn.com/image/fetch/$s_!e879!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png 1272w, https://substackcdn.com/image/fetch/$s_!e879!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaf1d3ba-1b03-4e2c-a532-91e00baa4e9c_714x224.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>This equation is a bit strange even to the mathematically adroit. To explain: j is the surface current density, c is the speed of light, n is the &#8220;normal&#8221; vector, and H represents the magnetic field vector. Ufimstev will explain that j0 is the surface current density in the &#8220;uniform&#8221; part (the part not accounting for edge currents, his contribution). The normal vector is just a line that sticks out from the surface, and the notation above means that its cross product is being taken with respect to another line, the line associated with the magnetic field. The cross product simply means that the result is a current (line) flowing along the surface, whose magnitude is largest when the surface is exactly perpendicular to the wave. When the surface is slanted with respect to the magnetic field, the generated current density is lower.</p><p>(A side note: the constant on the left involving the speed of light appears as though it messes up the units, but Ufimstev is using an alternative scheme in which the speed of light has no units. If you looked up this equation elsewhere, the c/2pi constant is removed)</p><p>As promised Ufimstev then gives the equation:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xEq_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xEq_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png 424w, https://substackcdn.com/image/fetch/$s_!xEq_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png 848w, https://substackcdn.com/image/fetch/$s_!xEq_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png 1272w, https://substackcdn.com/image/fetch/$s_!xEq_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xEq_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png" width="150" height="63.793103448275865" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:222,&quot;width&quot;:522,&quot;resizeWidth&quot;:150,&quot;bytes&quot;:14125,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xEq_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png 424w, https://substackcdn.com/image/fetch/$s_!xEq_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png 848w, https://substackcdn.com/image/fetch/$s_!xEq_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png 1272w, https://substackcdn.com/image/fetch/$s_!xEq_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32762eb7-1dd2-4b4c-883a-736aaf30a885_522x222.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>&#8230;where the total surface current density is the sum of the simplistic calculation above (j0) and a more complicated calculation (j1) that matters around the edges of the surface. The simple current density is called the &#8220;uniform part&#8221; and the other one is called the &#8220;non-uniform&#8221; part.</p><p>Why do we care about surface current density again? Recall that an electromagnetic wave is &#8220;reflected&#8221; when it is absorbed by the electrons on a surface (this generates current) and then released again (that is, it emits photons). So the surface current density determines how much the wave gets reflected. Ufimstev notes, &#8220;The scattered field created by the current [see the first equation] is then found by means of Maxwell's equations.&#8221; So once we have the current, we can figure out the generated wave (this is a known procedure covered outside the paper; they are very famous equations).</p><p><strong>Chapter 1: Diffraction by a Wedge</strong></p><p>If you are concerned with calculating the radar cross section of a plane that looks like the F-117 Nighthawk, wedges are very important. The Nighthawk is composed almost entirely of flat surfaces at harsh angles. These are wedges.</p><p>The paper proceeds by recapitulating a rigorous calculation of the scattered wave, which was first found by <a href="https://abbey.us.ai/assets/15909">Arnold Somerfeld</a>. It then uses the &#8220;physical optics approach&#8221; to find j0, the &#8220;uniform part.&#8221; The &#8220;non-uniform&#8221; part is the difference between the two.</p><p>At this point the paper becomes mathematically serious, and I cannot go through it all here.</p><p>&#8212;</p><p><strong>Conclusion</strong></p><p>I hope that the work so far has given you an idea of the contribution of Soviet mathematics to U.S. stealth technology and has introduced you to some of what you can learn on Abbey. And hopefully it&#8217;s inspired some of you to learn more about electricity and magnetism; I had too much fun with this project.</p>]]></content:encoded></item><item><title><![CDATA[A New Way to Learn With Abbey: Auto-Generated Courses and Quizzes]]></title><description><![CDATA[Create quizzes from documents in a few seconds to test yourself. Generate entire courses quickly in collaboration with AI. Browse our collections of high-quality learning content.]]></description><link>https://blog.goodreason.ai/p/a-new-way-to-learn-with-abbey-auto</link><guid isPermaLink="false">https://blog.goodreason.ai/p/a-new-way-to-learn-with-abbey-auto</guid><dc:creator><![CDATA[Gordon Kamer]]></dc:creator><pubDate>Thu, 04 Jan 2024 18:41:15 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!beX1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!beX1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!beX1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png 424w, https://substackcdn.com/image/fetch/$s_!beX1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png 848w, https://substackcdn.com/image/fetch/$s_!beX1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png 1272w, https://substackcdn.com/image/fetch/$s_!beX1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!beX1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png" width="1456" height="1103" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1103,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:286075,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!beX1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png 424w, https://substackcdn.com/image/fetch/$s_!beX1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png 848w, https://substackcdn.com/image/fetch/$s_!beX1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png 1272w, https://substackcdn.com/image/fetch/$s_!beX1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6988385-97ea-46c4-b3b2-6fa69c4ca072_1679x1272.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I know that for anything I&#8217;m trying to understand, <a href="https://abbey.us.ai/">Abbey</a> is the best place to do it. Suppose I have a textbook chapter or article that I need to know backwards and forwards: not only can I get a quality summary, but I can <strong>auto-generate a quiz in seconds to test myself</strong>. I find that I learn more from the quiz in one minute than I do from reading the article in ten. The things you get wrong tend to stick with you better. Quiz generation is just one new feature we&#8217;re launching on Abbey this week. In fact, it is not even the most important: for anything you want to book-up on, you can quickly make a curriculum for it on Abbey, which links to curated, high-quality sources, like textbooks and academic articles. <strong>The</strong> <strong>course is totally customizable, and created based on your conversation with AI</strong>.&nbsp;</p><p><strong>Curriculum Generation</strong></p><p>Abbey is not pedagogically prescriptive, so it won&#8217;t tell you what you must learn. The best studying follows a personalized structure. Abbey works with you, through natural language, to draft a curriculum based on your background knowledge and the task at hand. The curriculum is then automatically supplemented with sources from Abbey Collections and your uploaded files. For example, a curriculum on basic mechanical engineering might include a unit on joints. There, our permanent joint materials could be supplemented by a research paper you recently read about welding titanium that&#8217;s pertinent to your current project. Then after finishing the unit, Abbey can make a quiz directly from the sources in multiple choice or short answer format to test you on what you studied.&nbsp;</p><p>All sources are chattable, meaning you can compare relevant concepts without getting bogged down by jargon; just ask it to explain a term. You can make a curriculum for any kind of learning goal: get up-to-speed enough on recent research, pick up a totally new subject, or relearn all of high-school biology. And you can share your favorite courses and take courses made by others. This can be especially powerful inside an organization that needs to incorporate new tech and proprietary processes into onboarding/training courses for new hires.</p><p>In the spirit of learning new things, we&#8217;ve created an open Curriculum on AI which goes from the first models and their fundamentals to recent breakthroughs that we&#8217;re all so familiar with. You can check it out <a href="https://abbey.us.ai/assets/2569">here</a>.</p><p><strong>Quizzes</strong></p><p>Abbey's Quizzes enhance curricula by offering concise, targeted assessments for <em>any </em>topic. Quizzes are adaptable, suitable for in-depth course materials or for quickly getting up to speed on a single topic with a combination of multiple-choice and short-answer questions. This makes them ideal for professionals and students looking to go deep into a topic or quickly brush up. With specific concepts in mind you can tailor questions to precise ideas in a document or even create a whole quiz from scratch.&nbsp;</p><p>In a study group or organizational context the sharing functionality of quizzes enhances collaborative learning and ensures uniform understanding and training, which is particularly useful for new technologies and processes. And, it&#8217;s fun to quiz your friends.&nbsp;</p><p><strong>Collections</strong></p><p>Technical professionals and students and anyone else constantly learning often find themselves overwhelmed by the sheer volume of content spread across various sources. High-quality sources are required for real learning. Currently, there is an abundance of information which is dispersed across siloed sources like proprietary software, manufacturing processes, data sheets, national codes, online platforms like YouTube and GitHub, and traditional mediums like textbooks, handbooks, and research journals. To ingest this information requires constant attention shifts which is exhausting and hampers the learning process.&nbsp;</p><p>Collections is our curation of high-quality, accurate, and comprehensive resources for all fields. We aim to cover every major research area to supplement everyone&#8217;s learning experience with clear, detailed, and engaging materials. Biology, chemistry, physics, math, engineering, The National Electrical Code, material data sheets, machining handbooks, etc. As much of Wikipedia as is useful, Want to see something added to Abbey? <a href="https://abbey.us.ai/bug?type=feature&amp;from=%2F">Request a source here.</a></p><p><strong>The Future</strong></p><p>Enhancing collective intelligence is key to advancing humanity, and technical complexity and knowledge gaps slow that advance. By democratizing access to sources of knowledge through Collections and building AI tools like Curriculum that can reason through complex technical processes, we aim to break down these barriers. Whether you&#8217;re an engineer with advanced design challenges or a researcher navigating massive data sets and new nomenclature, everyone needs to tackle harder problems. AI-assisted tools that understand complex technical materials let employees get smarter, record their wisdom, stay on top of new research, and put the pieces of the puzzle together.</p><p>In our technical world continuous learning is a necessity that can be leveraged into an advantage. Onboarding, real-world learning, and a powerful knowledge base could all use the same underlying sources. Abbey equips everyone with the resources they need to grow in industry and role-specific ways. We want to build a future where everyone, regardless of their field or level of expertise, has permanent access to tailored educational resources.&nbsp;</p>]]></content:encoded></item><item><title><![CDATA[Introducing Abbey]]></title><description><![CDATA[An AI platform for sourced knowledge]]></description><link>https://blog.goodreason.ai/p/introducing-abbey</link><guid isPermaLink="false">https://blog.goodreason.ai/p/introducing-abbey</guid><dc:creator><![CDATA[Gordon Kamer]]></dc:creator><pubDate>Wed, 08 Nov 2023 18:00:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!mCQC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mCQC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mCQC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png 424w, https://substackcdn.com/image/fetch/$s_!mCQC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png 848w, https://substackcdn.com/image/fetch/$s_!mCQC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png 1272w, https://substackcdn.com/image/fetch/$s_!mCQC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mCQC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png" width="1132" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1132,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:23988,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mCQC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png 424w, https://substackcdn.com/image/fetch/$s_!mCQC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png 848w, https://substackcdn.com/image/fetch/$s_!mCQC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png 1272w, https://substackcdn.com/image/fetch/$s_!mCQC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5f184b48-6eeb-49fb-9345-38d85a32fb70_1132x819.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>This week we launched <a href="https://abbey.us.ai">Abbey</a>, an AI platform for sourced knowledge. We want Abbey to help you learn anything: pick up new skills, explore liberal arts, or just develop your curiosity. It should not take a degree and years on the job to reach a minimum level of competence in one subject: we hope that Abbey can make everyone an agile learner -- to get up to speed at a job, to get ready for a new career, or just to have fun.&nbsp;</p><p>We are building Abbey with these two principles at heart:</p><ul><li><p>High-quality sources, like academic articles and textbooks, are key to effective learning.</p></li><li><p>Abbey is not pedagogically prescriptive: we want people to use Abbey as they might want to use a library.</p></li></ul><p>We don&#8217;t want to just regurgitate the first page of google search results; search engines are too far gone. We also want your learning to be structured but ultimately self-guided.</p><h3>Who&#8217;s Abbey good for?</h3><ul><li><p><strong>Students</strong> who want to do their reading more effectively, or who want to better understand their course material.</p></li><li><p><strong>Researchers </strong>who want to dive deeper into papers.</p></li><li><p><strong>Professionals</strong> and companies who want to put their own documents into Abbey to help everyone get up to speed or build their skill-set.</p></li></ul><p>What Abbey is today and what we want it to be are two different things. Below we&#8217;ll discuss where we are and where we&#8217;re going.</p><p>It starts with a good way to read your documents but ends with an entirely new way to learn.</p><h3>The Best Way to Read</h3><p>Reading is hard: we miss important content, misunderstand things, and often can&#8217;t find what we know to be in the document. Abbey makes it much easier to read actively. Here&#8217;s how I use Abbey to read:</p><p><strong>Summarize</strong>: every document uploaded to Abbey comes with a summarization tool. The generated summary can take a minute or two to build since it is carefully examining the entirety of your document, no matter how long. When I first try to read a document, I use Abbey to make a summary, and then ask it to shorten the summary.</p><p><strong>Q&amp;A</strong>: After I read the full and shortened summaries, I start asking questions about the parts I find interesting.</p><p><strong>Showing sources</strong>: Each response to a question yields citations to specific passages in the document that I can examine to make sure the response is accurate. I also like to get the full context; this is how I begin to read the actual document.</p><p><strong>Semantic search</strong>: if I want to find more places in the article to read, I search over the document using our semantic search tool. This search is more like Googling over your text than using command-F (or control-F); it&#8217;s more powerful and can understand concepts and synonyms.</p><p>These features are available with every document you upload: Word docs, Excel files, and <strong>even PDF scans</strong> that are not ordinarily searchable. Abbey uses optical character recognition, a technique for recognizing text in images. The same goes for screenshots you upload.</p><h3>Full Control</h3><p>As the complexity of our tools grows, we become less knowledgeable about how they work. We hope that the features and interfaces we provide on Abbey help you better understand how the platform works and ultimately help you use it better. We don&#8217;t believe in keeping functionality from you under the guise that it&#8217;s proprietary knowledge.</p><p><strong>Use your documents:</strong> Users can upload anything they want to Abbey, and it will be there when they get back. You can organize those documents into folders so that you can use Abbey like a file system.</p><p><strong>Choose your model</strong>: we give users a choice between different models, including OpenAI&#8217;s GPT-4 (for subscribers). We hope to add even more models in the future.</p><p><strong>Choose your randomness</strong>: If you have used other chat-bots, you might notice that asking the same question a second time can yield a different response. We give you control over how much randomness is involved in this process: do you always want what AI thinks is the best answer, or do you want to try out different ideas?</p><p><strong>More:</strong> In the future, we want to give users even more control over how Abbey works. For example, we plan to give users control over the specific prompts used to generate summaries and responses.</p><h3>The Future</h3><p>Some things aren&#8217;t available yet on Abbey, but they will be soon. We&#8217;re working hard to make Abbey a much deeper offering than other platforms.</p><p><strong>Content Library:</strong> Textbooks and papers will be available for free and for purchase on Abbey to add to your library.</p><p><strong>Generated Curriculum:</strong> We want learning to be structured and personalized. Using sources you upload as well as our library of content, we want Abbey to outline a set of AI-powered assets to help you achieve your learning goals.</p><h3>Conclusion</h3><p>Much of the current AI discourse centers around its capacity to make our lives <em>easier,</em> but this is shortsighted. Masking and simplifying complex jobs in the hope that AI will handle it for us forces us to imagine a future where nobody has to do anything.</p><p>In reality, AI removes the ceiling on human potential; it allows us to do <em>harder</em> things that we couldn&#8217;t previously do alone. We&#8217;ll need lawyers, engineers, plant operators, critics, teachers, parents, and more to all have a better understanding of our world and the tools in it. We could flounder in pessimism, sure that the jobs will all be gone in ten years. Or, we could bet on ourselves and use AI to raise the floor of human competency.</p><p>This is not to say that Abbey is difficult to work with; it&#8217;s straightforward. The point is that as we iterate, we&#8217;ll work to provide the tools you need to learn the thing you thought you never could. Deep and accurate content libraries, curriculum generation, knowledge assessments &#8211; Abbey is a bicycle for the mind. These things are all on their way, and you can use them however you&#8217;d like. &nbsp;&nbsp;&nbsp;</p>]]></content:encoded></item><item><title><![CDATA[GPTs Don't Use a Database]]></title><description><![CDATA[ChatGPT isn't looking up facts for you.]]></description><link>https://blog.goodreason.ai/p/gpts-dont-use-a-database</link><guid isPermaLink="false">https://blog.goodreason.ai/p/gpts-dont-use-a-database</guid><dc:creator><![CDATA[Gordon Kamer]]></dc:creator><pubDate>Wed, 16 Aug 2023 18:01:09 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!RP2T!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RP2T!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RP2T!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!RP2T!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!RP2T!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!RP2T!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RP2T!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png" width="448" height="448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:448,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;https://cdn.discordapp.com/attachments/1122274633983475712/1141430458698186792/gordon_a_robot_sitting_in_a_room_shuffling_through_papers_books_b30d0348-d2c6-4fc7-8bef-b18b1ad8c682.png&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="https://cdn.discordapp.com/attachments/1122274633983475712/1141430458698186792/gordon_a_robot_sitting_in_a_room_shuffling_through_papers_books_b30d0348-d2c6-4fc7-8bef-b18b1ad8c682.png" title="https://cdn.discordapp.com/attachments/1122274633983475712/1141430458698186792/gordon_a_robot_sitting_in_a_room_shuffling_through_papers_books_b30d0348-d2c6-4fc7-8bef-b18b1ad8c682.png" srcset="https://substackcdn.com/image/fetch/$s_!RP2T!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!RP2T!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!RP2T!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!RP2T!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3392b6a-6e64-42ad-90df-5c686593cfc3_1024x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The most common misconception people have about ChatGPT is that when you ask it a question, it is in some way searching a database for an answer and returning it to you. The truth is that ChatGPT, like all language models, has no such database.</strong> Models don&#8217;t typically even use the internet at all. All knowledge that the models have learned is stored (mysteriously) in the model &#8220;parameters&#8221; &#8211; billions of numbers discovered through the training process that have little human interpretation.</p><p>In fact, it is generally unknown how (or whether it is possible) to get back the training dataset from the model&#8217;s parameters. Even worse, researchers often do not know where &#8211; even roughly &#8211; in that over 100 gigabyte file any particular bit of knowledge is stored. Your guess is as good as an expert&#8217;s in determining where a model stores &#8220;Joe Biden is the President of the United States.&#8221;</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.goodreason.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading US|AI Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>That we designed these models but still have little understanding of how they store knowledge strikes people as impossible. To try and explain how this fact is possible, I&#8217;ll do my best to explain terms like &#8220;training&#8221; and &#8220;parameters&#8221; in the context of the process for creating large language models.</p><p>Like the human brain, we have some intuition about how it might work, but the inner-workings of memory are unknown.</p><p><strong>What are model parameters?</strong></p><p>You have probably seen at least one kind of model before &#8211; the &#8220;line of best fit.&#8221; For example, suppose we were trying to predict home price from square feet. The plot below shows square footage on the x-axis and home-price on the y-axis. Shown is the line of best fit (the linear regression). While the relationship isn&#8217;t perfect, we can make a reasonable guess at the home price based on square feet by using the line&#8217;s equation: Price = 43.10 * (Square Feet) + 3226.44 (recall y=mx+b).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MJG_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MJG_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png 424w, https://substackcdn.com/image/fetch/$s_!MJG_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png 848w, https://substackcdn.com/image/fetch/$s_!MJG_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png 1272w, https://substackcdn.com/image/fetch/$s_!MJG_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MJG_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png" width="1456" height="902" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:902,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MJG_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png 424w, https://substackcdn.com/image/fetch/$s_!MJG_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png 848w, https://substackcdn.com/image/fetch/$s_!MJG_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png 1272w, https://substackcdn.com/image/fetch/$s_!MJG_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4fe126e2-0e4b-4d14-ac04-1ad83ee51ff7_1600x991.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This predictive model has two parameters &#8211; the numbers 43.10 and 3226.44. The first number tells us that, in the dataset (known home-price and square-feet pairs), an additional square foot raises a home&#8217;s price on average by $43.10.</p><p>A large language model does a similar task, except instead of square feet on the x-axis, the input variable is a <em>numerical representation of text</em>. The y-axis (the thing being predicted) is the <em>next word </em>(or, in AI-jargon, <em>token</em>, which includes things like punctuation or word-fragments). Don&#8217;t get hung up on the idea that words are not numbers: we can switch between the two through a &#8220;tokenization&#8221; and &#8220;embedding&#8221; process, which is not covered here.</p><p>Our linear model involved one multiplication (43.10 * Square Feet) and one addition (... + 3226.44). In a large language model, there are many more additions, subtractions, multiplications, and divisions. The exact formula is not covered here, but rest assured that it is straightforward for a computer to calculate with enough processing power.</p><p><strong>The Objective</strong></p><p>When a language model &#8220;generates&#8221; text, it is repeatedly predicting the next word in the sequence in the same way our linear regression tries to predict home prices.</p><p>Each prediction is added to the input text, and the model again predicts the following word. The process is the same as if you typed out a text on your phone and kept hitting the auto-complete suggestion over and over. Large AI models are just a lot more sophisticated than the model doing auto-complete on your text messages.</p><p>Fundamentally, the language model is just trying to predict the next token. The model is not too dissimilar from our linear regression above, except it has many more than just two parameters that it can add and multiply together. In fact, it could have as many as a trillion.</p><p><strong>Training</strong></p><p>In the above example, the process of determining the two parameters of our model from the training data (the blue X&#8217;s) is called <em>training</em>. For a simple linear model, there are many ways to do the training process.&nbsp;</p><p>One simple way to find our line of best fit would be to try out a bunch of different lines. At each step, we could compare our new random line to a previous line and throw away whichever is worse. After enough tries, we would end up with a reasonably good line.</p><p>Training large language models is not too different. We continually make small (partially random) changes to our parameters and test them against a portion of the dataset. The final result is the combination of parameters that perform our task best once the training budget has been spent.</p><p><strong>Where is the knowledge?</strong></p><p>How is it possible that the model knows so much without referencing any database? The answer is that predicting text requires knowledge, so our next-token prediction engine will predict tokens consistent with the nature of the universe. For example, if we give the model &#8220;The sky is&#8221; and ask for a continuation, the next word might be &#8220;blue&#8221;, which is both factually correct and the most likely next word.&nbsp;</p><p>No one explicitly programmed into the model the fact that the sky is blue. Nevertheless, in order to predict text effectively, that knowledge has to be encoded by the parameters in some way. The result of all those multiplications, additions, and divisions happened to be the right answer. Recovering which parameter(s), out of a trillion, gave the model the correct answer is like finding a needle in a haystack.</p><p><strong>Conclusion</strong></p><p>The best analogy is probably the human brain: you can think of the knowledge it has picked up as what it has memorized &#8211; correctly or incorrectly. For precise figures or obscure facts, language models, like humans, could often benefit from a real reference.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.goodreason.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading US|AI Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[What is actually in GPTs' training data?]]></title><description><![CDATA[Data quality matters, and copyright issues abound.]]></description><link>https://blog.goodreason.ai/p/what-is-actually-in-gpts-training</link><guid isPermaLink="false">https://blog.goodreason.ai/p/what-is-actually-in-gpts-training</guid><dc:creator><![CDATA[Gordon Kamer]]></dc:creator><pubDate>Tue, 18 Jul 2023 00:26:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vLlQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vLlQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vLlQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!vLlQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!vLlQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!vLlQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vLlQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png" width="494" height="494" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:494,&quot;bytes&quot;:1009338,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vLlQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!vLlQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!vLlQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!vLlQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67bf21d7-8dee-4f9b-ade8-850f7a2c3d6e_1024x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Language models&#8217; training data are the source of all their knowledge. Massive collections of unstructured text are given to the model over the course of training (a months long process). The data influence the parameters of the model, an encoding of all it knows. Once the data are in the model, it is generally not possible to examine them (they are not stored in any database). The model is completely fixed unless the expensive training process resumes.</p><p>The quality of a model depends on the quality of the data: garbage in, garbage out. OpenAI is getting <a href="https://www.sfchronicle.com/bayarea/article/s-f-openai-sued-scraping-data-18180191.php">sued</a> <a href="https://www.sfgate.com/tech/article/sarah-silverman-sues-meta-openai-chatgpt-18193292.php">regularly</a> over what it puts in its training mix. So what actually is being fed to our models?</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.goodreason.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading US|AI Blog! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Before taking a closer look at all the popular datasets, there are some big takeaways:</p><ul><li><p>The vast majority of data (90%+) come from internet scraping.</p></li><li><p>There is very little non-English data fed to most (American-made) models, often with zero Chinese.</p></li><li><p>High-quality data are sparse, and much of it is copyrighted (litigants have a strong case against AI companies).</p></li><li><p>Even the &#8220;high-quality&#8221; fine-tuning data are not particularly high-quality.</p></li></ul><p>At a glance, you can see some datasets used for popular models below. The information is taken from the papers and blog posts that accompanied those models&#8217; releases.</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5ljt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5ljt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png 424w, https://substackcdn.com/image/fetch/$s_!5ljt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png 848w, https://substackcdn.com/image/fetch/$s_!5ljt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png 1272w, https://substackcdn.com/image/fetch/$s_!5ljt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5ljt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png" width="626" height="829" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:829,&quot;width&quot;:626,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:81846,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5ljt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png 424w, https://substackcdn.com/image/fetch/$s_!5ljt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png 848w, https://substackcdn.com/image/fetch/$s_!5ljt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png 1272w, https://substackcdn.com/image/fetch/$s_!5ljt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68739ab9-6806-4aa9-b8fe-dc2718d4f2dc_626x829.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>MPT, Llama, and Falcon are three of the most popular open source models. Their quality appears to be somewhere between GPT-3 and ChatGPT (GPT 3.5), but additional fine-tuning and reinforcement learning might increase their performance to near GPT-4 levels. MPT is MosaicML&#8217;s offering (recently purchased by Databricks); Llama is Meta&#8217;s project; and Falcon is from a relatively unknown research institute in Abu Dhabi, the Technology Innovation Institute (TTI). Falcon performs surprisingly well on <a href="https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard">HuggingFace&#8217;s Open LLM Leaderboard.</a> Each of those models comes in different sizes, the smallest of which are trained on the least number of tokens (tokens can be thought of as words).</p><p>As for GPT-4, OpenAI has intentionally kept private the data the model was trained on. In the <a href="https://arxiv.org/pdf/2303.08774.pdf">GPT-4 technical report</a>, OpenAI says that it was trained on both public and privately licensed data. We can assume that the data mix is comparable to the open models, but there are obviously differences: for one, GPT-4 is capable of reading and writing Chinese.</p><p>One last thing to keep in mind: the goal of a language model is fundamentally to predict text. In reading the examples below, consider how feasible it is to predict what comes next based on the previous text. Could you do it? What would you have to know about the world to guess the next word successfully? Those questions are at the heart of the new AI.</p><p></p><p><strong>Web Scraping</strong></p><p></p><p>From the table, you can see that completely random web content makes up a perhaps surprisingly large proportion of the data. The most popular web-scraping dataset is from Common Crawl, a non-profit (C4 and mC4 are derivatives). The data consist of 40+ languages, but the vast majority are filtered out before training begins (ostensibly so as to not confuse the model). Common Crawl has been collecting data for 7 years monthly or bi-monthly by randomly sampling a subset of the internet. The <a href="https://commoncrawl.org/2023/06/may-june-2023-crawl-archive-now-available/">most recent crawl</a> is about 390 terabytes.</p><p>Common Crawl follows rules against web scraping: if a website has &#8220;CCBot&#8221; in its <code>robots.txt</code> file, the bot will not crawl the site. Of course, less scrupulous companies can simply ignore it.</p><p>To get a feel for the data, below are two examples.</p><p></p><p>Common Crawl Example 1:</p><pre><code><code>"Konstantinos Papachristou The Darkness of a Fairy Tale &#8220;We are living in a world where we are constantly bombarded by advertisements&#8230;&#8221; &#8220;We have lost the true meaning of life&#8230;&#8221; The Darkness of a Fairy Tale is the debut album of saxophone player and composer Konstantinos Papachristou, and describes the journey of a man that receives his wake-up call and begins an esoteric quest in an effort to find the true meaning of life. Konstantinos composed a contemporary jazz concept-album, with an often dark, programmatic intention, that allures the listener to dive deep into his senses. The arrangement retains an unrefined classic jazz sound, and the instrumentation includes a double bass, piano and drums that blend perfectly with the composer&#8217;s fat sounding, tenor saxophone and strong influence of the ECM Jazz label. Been a very creative person, Konstantinos loves music as well as High-End Audio, and he has been involved in various music projects, and has also designed High-End Audio Equipment for consumer and professional use. He supervised the whole production process of the album, and imprinted his personal standpoint in the sound of the album. Darkness of a Fairy Tale retains a live feel with the calmness and focus of a studio album, making it the best of both worlds. Darkness of a Fairy Tale is released in an aesthetic digipak audio-CD featuring cover art from the stunning original painting &#8220;Jours et Nuits No4&#8221; (acryl on canvas, 1996, Vangelis Papachristos) and also on every popular digital music store world wide. You can like Konstantinos&#8217; facebook page here: https://www.facebook.com/KonstPapachristou/ &#932;&#949;&#955;&#949;&#965;&#964;&#945;&#943;&#949;&#962; &#954;&#965;&#954;&#955;&#959;&#966;&#959;&#961;&#943;&#949;&#962; Kaisarias str 101, 18450 Nikaia, Greece"</code></code></pre><p></p><p>Common Crawl Example 2:</p><pre><code><code>"&#2319;&#2360;&#2348;&#2368;&#2319;&#2360; &#2361;&#2367;&#2306;&#2342;&#2368; &#2360;&#2375; &#2332;&#2369;&#2396;&#2375; &#2352;&#2361;&#2367;&#2319;&#2306; Meanings of lesbian in hindi Lesbian themes in Hindi cinema Nazshura 13.08.2018 1 Hindi lesbian video Mezikus 13.08.2018 1 Comments There was a big production house involved and we were launching leads. Their relationship remained ambiguous in the end, but it sparked conversations on lesbian relationships. But there is hope for the future with the younger generation being open-minded in this regard. They have what is most important in a film - a story. He recounts that back in , there was hardly any original content from India. Starring Sunny Leone and Sandhya Mridul, there is a passionate kiss between the two female protagonists of the film. The homosexuality is subtle but obvious and makes the film a pure delight to watch, especially when viewers are under the false impression that the love story is between Arshad-Huma and Madhuri and Naseeruddin. This is a refreshing change from the usual stereotypes seen on the silver screen in India. Both Isha and Amrita received flak for doing such a film as that time, Bollywood and viewers were still opening up to the theme of sexuality, something that was not exploited too much on the silver screen. Siras craves companionship and a human connection, which never comes. However, even with so many movies being produced, there are only a handful that deal with LGBT themes or have significant queer characters in them. Fire is a beautiful story of two women who are married to two brothers who neglect them and end up having a relationship with each other, thus exploring their sexuality. But it is also about ageing, alienation and loneliness so acute that it renders you hollow.</code></code></pre><p></p><p>There are two main things to notice here: first, the data are completely unformatted. All of those ordering cues and side panels on a website show up in the data inline. Note the two truncated quotes at the start of example 1. It is pretty messy. Moreover, there is no filter for explicit themes: the second example discusses lesbianism in Hindi cinema. It is hard to predict what the model will learn.</p><p></p><p><strong>Books</strong></p><p></p><p>The &#8220;books&#8221; datasets contain highly coveted long-form text. If you would like a model to be able to maintain a train of thought over long sequences, you need text with long-term dependencies. That text almost uniquely comes in the form of books: non-fiction and fiction alike. Ideally, we would train on every book in existence (and the reality is not far off; there just aren&#8217;t a lot of them). The widely used books datasets are referred to as &#8220;books1&#8221;, &#8220;books2&#8221;, and &#8220;books3&#8221;.</p><p>Unfortunately, what exactly is in books1 and books2 is fairly mysterious. Some online speculation leans toward books2&#8217;s being <a href="https://libgen.is/">libgen</a>. Books3 contents are better known: it is all of bibliotik, a trove of pirated books (hence the cryptic pseudonym &#8220;books3&#8221;). Companies should be afraid to release models trained on books3: MosaicML has a separate series of models, StoryWriter, trained on books3 and licensed separately (probably still illegally, however). As for OpenAI, they have already been <a href="https://news.bloomberglaw.com/ip-law/openai-hit-with-class-action-over-unprecedented-web-scraping">sued</a>. The question is whether releasing a model trained on some data is akin to releasing the data itself.</p><p>On the other hand, Gutenberg is a repository of public-domain books. There is no worry about copyright infringement.</p><p></p><p><strong>Multi-Lingual Data</strong></p><p></p><p>While the internet has a lot of non-English data, most open models choose to focus on English. The data fed to Llama contains only languages having the Latin or Cyrillic alphabets. If you ask a question to Llama in Chinese, it will sputter (not so for ChatGPT, however). You can see in the table above and in the section on data-processing below which datasets were distilled into English for which models.</p><p>It is not surprising, then, when Chinese labs <a href="https://cevalbenchmark.com/static/leaderboard.html">claim superiority</a> to American models on Chinese-language tasks. In the linked table, ChatGLM-2 is a tiny (6b) model that supposedly outperforms GPT-4. At the bottom of the list is Llama. That it can make any sense in Chinese despite being trained on only the small amount of Chinese text that made it through a filter is surprising in itself - not that it performs so terribly. On the other hand, GPT-4 was probably trained on a lot more Chinese; that it ranks only number 2 is somewhat surprising. However, this fact may reveal that whatever Chinese text GPT-4 was trained on, it was probably limited.</p><p></p><p><strong>Other Pre-Training Data</strong></p><p></p><p>Rounding out the data mixes are typically scholarly or general knowledge texts, such as Wikipedia. These are high quality datasets with important information. Because there is no guarantee the model will learn facts present in the data upon just one viewing, these datasets are often trained over multiple times. Llama samples everything from Wikipedia twice on average.</p><p>ArXiv (pronounced &#8220;archive&#8221;) is a repository of academic papers from STEM fields. In Computer Science, it is popular to release a paper on ArXiv first as a pre-print. It is not surprising that the computer scientists who train models make sure that their own papers are included in the data.</p><p>Semantic Scholar is a more interdisciplinary platform, included in a few models. </p><p>Falcon goes above and beyond with regard to academic data: TII explicitly mentions training on PubMed and other repositories.</p><p>Many models are trained on code as well as text. This code mostly comes from GitHub, an online repository of open source software.</p><p>Lastly, companies typically add Stack Exchange data to the mix as well. Stack Exchange is a site of questions and answers. Stack Overflow is an analogous site just for code. This data is considered high-quality, especially for models designed to become chat bots. In the pre-processing step, Llama sorts answers by score (highest to lowest). See an example below from the Stack Exchange dataset available through Red Pajama.</p><pre><code><code>"Q: Is it safe to always leave a Macbook Pro on? I have an external monitor, keyboard, and mouse connected to my Macbook Pro. I have the lid closed and have it running. I run it as if it were a desktop PC, always on. I only shut off the LCD attached to it. Is it safe to always have the Macbook Pro on and running? About every other week I let the battery drain almost completely. I also have a Belkin laptop cooling stand running underneath it. Is there anything else I can do to safely have my MBP running most of the time? Update: Just wanted to add that I have the newer 2010 Macbook Pro. A: Sure The current uptime on my MacBook is 21 days. One thing to note: * *I wouldn't drain the battery like that. Lithium ion batteries don't work like NiCa batteries with memory. If you have it continuously plugged in, charge to 60% and then remove the battery. A: Technically, yes, but from a long-term maintenance standpoint, I would reccommend at least rebooting once a week. I reboot all my Macs first thing Monday morning, after the Sunday night system maintenance runs, and I never have problems with crashes, memory and the like (unless, of course I do something stupid while programming). Even Mac OS X needs a quick refresh every now and then. A laptop is really not that much different from a desktop computer; it's still a hard drive, ram, cpu, keyboard, and mouse just in a compact form factor. Anything you would do with a desktop machine, do with your laptop. A: From a software standpoint, it's perfectly fine to leave your Mac on continuously. It can even be considered recommended, since Mac OS X has a number of optimization scripts that run on a daily, weekly, monthly basis, but can only do so if the computer is on. From a hardware standpoint, it depends on your model. If you have an older (pre-2008) MacBook with a removable battery, leaving it on and plugged in continuously will kill the battery quick. In that case, you should remove the battery from the machine, only putting it back in occasionally, charging it to around 60%, to keep it in good working order. The new MacBooks with non-removable batteries are designed to be able to be left plugged in constantly. The battery has circuitry to optimize the charge cycles so that the life of the battery is not adversely degraded over time (more than just normal use causes). "</code></code></pre><p></p><p><strong>Data Processing</strong></p><p></p><p>The most popular method for data processing is called the CCNet pipeline. It involves first de-duplicating the text and then applying a small language model to compare the given text to known high-quality data. If they seem similar according to the model, then the data are used. This pipeline is used in the RedPajama datasets, which were in turn used by MPT. The CCNet pipeline or something like it is generally applied to every new language model.</p><p>There are also a variety of heuristics used to prune the non-web-scraping datasets. For example, here are the criteria used by MPT to filter ArXiv papers:</p><ul><li><p>Is open access</p></li><li><p>Has title and abstract</p></li><li><p>Paper is in English</p></li><li><p>&gt;=500 words and &gt;=5 paragraphs</p></li><li><p>Paper was published after 1970 and before 2022-12-01</p></li><li><p>Most frequent word in the paper consists of alphabetic characters only, and it appears in less than 7.5% of the document.</p></li></ul><p>There is a lot of junk on the internet. Sometimes even the most basic rules make a difference.</p><p></p><p><strong>Fine-Tuning Data</strong></p><p></p><p>Fine-tuning data is what turns a regular language model into a chat bot. Applying this data follows the pre-training step, which involves all the data above. The data include question/answer pairs, usually <a href="https://www.theverge.com/features/23764584/ai-artificial-intelligence-data-notation-labor-scale-surge-remotasks-openai-chatbots">hand-written by people</a> in the Philippines or other developing countries. In the case of the Dolly-RLHF dataset from Databricks, the data were collected from &#8220;thousands of Databricks employees [and consists of] several behavioral categories.&#8221; Another popular dataset is the Anthropic &#8220;helpful and harmless &#8221; dataset (used in MPT&#8217;s chat variant). It includes for some of the examples annotations regarding the answers&#8217; helpfulness and harmlessness.</p><p>These datasets are typically very small. The main Dolly RLHF dataset is 15k examples: enough to fit on a floppy disk (compressed, probably).</p><p>Since people are being paid to generate the data, you might expect high-quality text. However, it is not clear if the story is as good as it sounds. Here are some typical examples from the Dolly RLHF dataset:</p><pre><code>Q: "Why are In-N-Out cheeseburgers so good?"
A: "In-N-Out cheeseburgers are seen as good for a few reasons. The meat is never frozen and tight quality controls at their processing plants insure a consistent and high quality meat patty. The incentive structure for the In-n-out employee results in workers staying at the fast food chain longer with the manager frequently in charge of employees and the actual cooking of the burger patties. This method is opposite of competing fast food restaurants who assign the burger cooking to entry level employees and customer management to higher levels. In-n-out maintains a smaller menu focusing on a few key items instead of an expansive or seasonally themed menu making it easier to maintain high standards of quality. Lastly their slow growth and enthusiastic fans provide a positive atmosphere lending itself to people thinking more highly of In-n-out where without this influence might be neutral on the brand."</code></pre><pre><code>Q: "What should be your day to day creator device ?"
A: "1. Iphone pro max 2. DJI OSMO pocket 2 DJI is such an awesome device to shoot as it has it's gimble and easy to operate. The video quality is actually nice"</code></pre><p>Both contain time-period specific brands (the first reading more like an ad for In-N-Out than anything, and who knows if it&#8217;s true). We would rather not throw away our data once a new iPhone comes out, yet the second example depends on the relevancy of the &#8220;Iphone pro max 2&#8221;. Moreover, I did not realize that a &#8220;day to day creator device&#8221; was a common product (it is slightly upsetting that such a question even makes sense). Anyway, now I know that the DJI OSMO pocket 2 DJI is &#8220;gimble [sic] and easy to operate.&#8221;</p><p>When handcrafting the data that will become our overlords&#8217; personality, you might expect a team of great experts carefully tinkering on a robotic mind. Instead, some Databricks employee was just trying to meet a quota.</p><p></p><p><strong>Conclusion</strong></p><p></p><p>There is plenty of room at the bottom. Data quality has a clear impact on model quality. Still, what we put into our models can range from genius (ArXiv papers) to dumb (Dolly-RLHF) to simply malformed (web scrapes). Moreover, the rights to the highest quality data are still completely up in the air. Companies should focus on what is in the data, not just how much.</p><p></p><p><em>Gordon Kamer is the founder of U.S. Artificial Intelligence Inc. For comments or inquiries about the company, email gordon@us.ai.</em></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.goodreason.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading US|AI Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>