Bitcoin's Pullback Could Be Your Gain

More than 70 cryptos have recently outperformed Bitcoin and it's not the first time. During crypto's last bull market the top 100 coins NOT named Bitcoin went up by 174%. Now the signs show that it's happening again. This could be a chance to strike gold in some far corners of the crypto market.

AI 'gold rush' for chatbot training data could run out of human-written text

MATT O'BRIEN
June 06, 2024

Artificial intelligence systems like ChatGPT could soon run out of what keeps making them smarter -- the tens of trillions of words people have written and shared online.

A new study released Thursday by research group Epoch AI projects that tech companies will exhaust the supply of publicly available training data for AI language models by roughly the turn of the decade -- sometime between 2026 and 2032.

Comparing it to a "literal gold rush" that depletes finite natural resources, Tamay Besiroglu, an author of the study, said the AI field might face challenges in maintaining its current pace of progress once it drains the reserves of human-generated writing.

In the short term, tech companies like ChatGPT-maker OpenAI and Google are racing to secure and sometimes pay for high-quality data sources to train their AI large language models - for instance, by signing deals to tap into the steady flow of sentences coming out of Reddit forums and news media outlets.

In the longer term, there won't be enough new blogs, news articles and social media commentary to sustain the current trajectory of AI development, putting pressure on companies to tap into sensitive data now considered private -- such as emails or text messages -- or relying on less-reliable "synthetic data" spit out by the chatbots themselves.

"There is a serious bottleneck here," Besiroglu said. "If you start hitting those constraints about how much data you have, then you can't really scale up your models efficiently anymore. And scaling up models has been probably the most important way of expanding their capabilities and improving the quality of their output."

The researchers first made their projections two years ago -- shortly before ChatGPT's debut -- in a working paper that forecast a more imminent 2026 cutoff of high-quality text data. Much has changed since then, including new techniques that enabled AI researchers to make better use of the data they already have and sometimes "overtrain" on the same sources multiple times.

But there are limits, and after further research, Epoch now foresees running out of public text data sometime in the next two to eight years.

The team's latest study is peer-reviewed and due to be presented at this summer's International Conference on Machine Learning in Vienna, Austria. Epoch is a nonprofit institute hosted by San Francisco-based Rethink Priorities and funded by proponents of effective altruism -- a philanthropic movement that has poured money into mitigating AI's worst-case risks.

Besiroglu said AI researchers realized more than a decade ago that aggressively expanding two key ingredients -- computing power and vast stores of internet data -- could significantly improve the performance of AI systems.

The amount of text data fed into AI language models has been growing about 2.5 times per year, while computing has grown about 4 times per year, according to the Epoch study. Facebook parent company Meta Platforms recently claimed the largest version of their upcoming Llama 3 model -- which has not yet been released -- has been trained on up to 15 trillion tokens, each of which can represent a piece of a word.

But how much it's worth worrying about the data bottleneck is debatable.

"I think it's important to keep in mind that we don't necessarily need to train larger and larger models," said Nicolas Papernot, an assistant professor of computer engineering at the University of Toronto and researcher at the nonprofit Vector Institute for Artificial Intelligence.

Papernot, who was not involved in the Epoch study, said building more skilled AI systems can also come from training models that are more specialized for specific tasks. But he has concerns about training generative AI systems on the same outputs they're producing, leading to degraded performance known as "model collapse."

Training on AI-generated data is "like what happens when you photocopy a piece of paper and then you photocopy the photocopy. You lose some of the information," Papernot said. Not only that, but Papernot's research has also found it can further encode the mistakes, bias and unfairness that's already baked into the information ecosystem.

If real human-crafted sentences remain a critical AI data source, those who are stewards of the most sought-after troves -- websites like Reddit and Wikipedia, as well as news and book publishers -- have been forced to think hard about how they're being used.

"Maybe you don't lop off the tops of every mountain," jokes Selena Deckelmann, chief product and technology officer at the Wikimedia Foundation, which runs Wikipedia. "It's an interesting problem right now that we're having natural resource conversations about human-created data. I shouldn't laugh about it, but I do find it kind of amazing."

While some have sought to close off their data from AI training -- often after it's already been taken without compensation -- Wikipedia has placed few restrictions on how AI companies use its volunteer-written entries. Still, Deckelmann said she hopes there continue to be incentives for people to keep contributing, especially as a flood of cheap and automatically generated "garbage content" starts polluting the internet.

AI companies should be "concerned about how human-generated content continues to exist and continues to be accessible," she said.

From the perspective of AI developers, Epoch's study says paying millions of humans to generate the text that AI models will need "is unlikely to be an economical way" to drive better technical performance.

As OpenAI begins work on training the next generation of its GPT large language models, CEO Sam Altman told the audience at a United Nations event last month that the company has already experimented with "generating lots of synthetic data" for training.

"I think what you need is high-quality data. There is low-quality synthetic data. There's low-quality human data," Altman said. But he also expressed reservations about relying too heavily on synthetic data over other technical methods to improve AI models.

"There'd be something very strange if the best way to train a model was to just generate, like, a quadrillion tokens of synthetic data and feed that back in," Altman said. "Somehow that seems inefficient."

------------

The Associated Press and OpenAI have a licensing and technology agreement that allows OpenAI access to part of AP's text archives.

Continue Reading...

Popular

White House's 50-year mortgage proposal has one notable benefit but a number of drawbacks

NEW YORK (AP) — The White House says it is considering backing a 50-year mortgage to help alleviate the home affordability crisis in the country. But the announcement drew immediate criticism from policymakers, social media and economists, who said a 50-year mortgage would do little to resolve other core problems in the housing market, such as a lack of supply and high interest rates.

If You Hold Any Dollars in Your Bank Account, Read This... - Ad

Strange events are unfolding in the global financial system. A monetary reset dubbed the "Mar-a-Lago Accord" is quietly in motion, and the financial elite are already taking protective action. If history is any guide, you could lose up to 40% of your wealth in the next two years. Move your money before it's too late.

Trump Threatens Air Traffic Controllers Amid Shutdown Chaos; Pete Buttigieg Says He 'Wouldn't Last Five Minutes' in Their Job

President Donald Trump has demanded that all air traffic controllers return to work as the nation's aviation system endured another wave of mass flight cancellations, caused by staffing shortages due to the prolonged government shutdown.

Trump Triggered 70% Gains Overnight -- This Rare Earths Stock Could Be Next - Ad

Trump's turning tiny mining stocks into overnight fortunes... and this little-known rare earths miner could be his next billion-dollar BUY. If Trump cuts a deal you could see a 50% to 200% pop overnight. But you must act before the next deal hits the wire.

XRP Jumps 10% In A Week As First-Ever Spot ETF Eyes Thursday Launch

XRP (CRYPTO: XRP) surged 10% over the past seven days amid mounting anticipation for the first-ever XRP ETF, which could launch as early as this week.

This Company Could Challenge NVIDIA's Reign - Ad

This new chip can run at the speed of light and it's changing the game. "TF3" could replace silicon entirely and one American company is producing it commercially. Clients already include NASA and top medical research institutions. It's still under the radar - and that's the opportunity.

Movie Review: Time has outrun this 'Running Man'

It’s always interesting when time overtakes the dystopias of the past. In 1982 novel “The Running Man,” the United States has fallen into a totalitarian state, divided between haves and have-nots, where all movements can be surveilled and realistic video propaganda is easily generated. King’s book was set in the year 2025.

Weiss Gold Veteran Makes Shocking New Call - Ad

Weiss expert Sean Brodrick went out on a limb last year and declared a historic event would send the yellow metal to $3,150. People laughed at him at the time, but he was off by just two days. Now, Sean has a shocking new prediction for gold ... and reveals a little-known way to get ahead of this bull market.

Piper Sandler Set For Best-In-Class Growth As Banking Cycles Turn: Analyst

Piper Sandler (PIPR) upgraded to Buy by Goldman Sachs with a target price of $386, projecting an 18% upside.

Elon Musk Says Tesla, xAI Are 'Trending Towards Convergence' In Some Ways

Elon Musk's companies could potentially merge in the future as he continues to integrate AI into his ventures.

Legally "Skim" $6,361 Into Your Account? - Ad

A former hedge fund manager is now sharing his "Skim Codes" with regular people. They're not stocks. They're not crypto. They're 18-character codes designed to profit from recent market conditions. All you have to do is punch them into an ordinary brokerage account. 84% of these codes have given people the chance to generate cash payouts so far... and his next code is going out any day now.

Air travelers face frustration as FAA's further drop in flights takes effect

Air travelers could face as busy U.S. airports need to meet a higher Federal Aviation Administration target for reducing flights Tuesday after already canceling thousands to scale back demands on the nation’s aviation system during the .

Barrick's Breakup Rumors, North America Versus The World

Barrick Mining (NYSE: B) may split into two companies, one focused on North America and another on Africa and Asia.

Trump Signs Law to Launch Dollar 2.0 - Ad

Trump just signed law S.1582, unleashing the biggest money shift in 100+ years. For the first time since 1913, private firms - not the Fed - can mint a "Dollar 2.0." Treasury says it could drain $6.6T from banks and pay 10X current savings rates. Early investors in minting firms could see 40X returns by 2032.

Dogecoin Fakes A Rally Then Dumps 3%—But Why?

Dogecoin (CRYPTO: DOGE) fell close to 3% on Tuesday, as large holders shifted roughly $32 million worth of DOGE to exchanges, putting sellers back in control.

Elon's $25 Trillion Confession - Ad

Elon Musk: "Tesla will become a $25 trillion company." That would make Tesla 8x bigger than Apple today. How is that possible? He admits it's all thanks to this one AI breakthrough that will take AI out of our computer screens and manifest a 250x boom here in the real world.

Everyday volunteers are providing stopgap services during the shutdown in a show of community power

NEW YORK (AP) — It started with a late October meeting between a lifestyle entrepreneur, a marketing professional, a restaurant owner and a social worker at a brewery in the Florida panhandle. Within hours, Pensacola Grocery Buddies was born.

Gen Z Takes To 'Income Stacking' As One Pay Check Falls Short

Gen Zers are turning to income stacking to secure their financial future as they fear a single paycheck won't be enough. AI and broken social contract fuel the shift. Side hustles becoming core of young careers.

The Market Just Crossed a Dangerous Line - Ad

The man who predicted the 2008 crash and 2020 says today's soaring markets are NOT a bubble - they're something far stranger and more dangerous. He says it's about to change everything you know about money.

These 8 Democrats voted with Republicans on the government shutdown deal. Here's how they explain it

WASHINGTON (AP) — The Democratic senators — eight in total — faced almost instant blowback from members of their own party as to allow the Senate to move forward on that would reopen the government.

Why Is Occidental Petroleum Stock Gaining Tuesday?

Analysts highlight OXY's robust earnings, production guidance, and expanded resource base as key growth drivers.

Investing Legend Hints the End May Be Near for These 3 Iconic Stocks - Ad

Futurist Eric Fry say Amazon, Tesla and Nvidia are all on the verge of major disruption. To help protect anyone with money invested in them, he's sharing three exciting stocks to replace them with. He gives away the names and tickers completely free in his brand-new "Sell This, Buy That" broadcast.

Trump-Pardoned Ponzi Schemer Faces 37-Year Sentence For $44 Million COVID Scam

A previously convicted Ponzi schemer who had received a pardon from President Trump is heading back to prison due to involvement in a new fraudulent scheme.

Trump's $2,000 Tariff Dividend Stumbles On Math: Cost Far Exceeds Revenue

Trump's $2,000 tariff rebate plan faces a major math gap, with estimated costs up to $606.8B—far exceeding tariff revenues in 2025 and 2026, says the Tax Foundation.

Bitcoin's Pullback Could Be Your Gain - Ad

More than 70 cryptos have recently outperformed Bitcoin and it's not the first time. During crypto's last bull market the top 100 coins NOT named Bitcoin went up by 174%. Now the signs show that it's happening again. This could be a chance to strike gold in some far corners of the crypto market.

JPMorgan Forecasts Bitcoin Bottom, Anticipates $28.3 Trillion Challenge To Gold By 2026

Analysts at JPMorgan have pinpointed the lowest point of the ongoing Bitcoin (CRYPTO: BTC) price fall and also projected a substantial chall

Wall Street Roars Back As Rate-Cut Odds Surge — This Week In Markets

Stocks rallied on rising Fed rate cut bets for December, with Alphabet hitting $4T and health care, auto sectors showing renewed strength.

If You Hold Any Dollars in Your Bank Account, Read This... - Ad

Strange events are unfolding in the global financial system. A monetary reset dubbed the "Mar-a-Lago Accord" is quietly in motion, and the financial elite are already taking protective action. If history is any guide, you could lose up to 40% of your wealth in the next two years. Move your money before it's too late.

An archaeologist is racing to preserve Sudan's heritage as war threatens to erase its cultural past

PARIS (AP) — In a dimly lit office in a corner of the French National Institute for Art History, Sudanese archaeologist Shadia Abdrabo studies a photograph of pottery made in her country around 7,000 B.C. She carefully types a description of the Neolithic artifact into a spreadsheet.

Trump's pardon of ex-Honduran president Hernández injects wild card into election

TEGUCIGALPA, Honduras (AP) — The day before Honduras , suddenly the main topics of conversation here shifted from domestic matters to and the former Honduran president he had pardoned.

Trump Triggered 70% Gains Overnight -- This Rare Earths Stock Could Be Next - Ad

Trump's turning tiny mining stocks into overnight fortunes... and this little-known rare earths miner could be his next billion-dollar BUY. If Trump cuts a deal you could see a 50% to 200% pop overnight. But you must act before the next deal hits the wire.

Trump May Formally Offer Putin Control of Occupied Ukrainian Land in Proposed Peace Deal

The U.S. is reportedly prepared to formally acknowledge Russia's hold over Crimea and parts of eastern and southern Ukraine as part of a proposed agreement to end the war.

Paul Krugman Warns AI Rallies Driven By Rate-Cut Hopes Are 'Dead Cat Bounces' — Says It 'Bears an Unmistakable Resemblance' To The Dot-Com Era

Economist Paul Krugman is drawing sharp parallels between the current state of the AI trade and the final years of the dot-com boom in the 1990s, while warning that investors might be misreading the Federal Reserve's recent signals and actions.

This Company Could Challenge NVIDIA's Reign - Ad

This new chip can run at the speed of light and it's changing the game. "TF3" could replace silicon entirely and one American company is producing it commercially. Clients already include NASA and top medical research institutions. It's still under the radar - and that's the opportunity.

Trending Now

Information, charts or examples are for illustration and educational purposes only and not for individualized investment management This message contains commercial elements, such as advertising. We only send these offers to those who have opted in to our newsletter. Past performance is not indicative of future results. For these reasons we strongly suggest trading in a DEMO/Simulated account. The information provided by us is for educational and informational purposes only. We make no representations or warranties concerning the products, practices or procedures of any company or entity mentioned or recommended and have not determined if the statements and opinions of the advertiser are accurate, correct or truthful. If you use, act upon or make decisions in reliance on information contained or any external source linked within it, you do so at your own peril and agree to hold us, our officers, directors, shareholders, affiliates and agents without fault.

Copyright markethundred.com
Privacy Policy | Terms of Service