Firstpost
  • Home
  • Video Shows
    Vantage Firstpost America Firstpost Africa First Sports
  • World
    US News
  • Explainers
  • News
    India Opinion Cricket Tech Entertainment Sports Health Photostories
  • Asia Cup 2025
Apple Incorporated Modi ji Justin Trudeau Trending

Sections

  • Home
  • Live TV
  • Videos
  • Shows
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Health
  • Tech/Auto
  • Entertainment
  • Web Stories
  • Business
  • Impact Shorts

Shows

  • Vantage
  • Firstpost America
  • Firstpost Africa
  • First Sports
  • Fast and Factual
  • Between The Lines
  • Flashback
  • Live TV

Events

  • Raisina Dialogue
  • Independence Day
  • Champions Trophy
  • Delhi Elections 2025
  • Budget 2025
  • US Elections 2024
  • Firstpost Defence Summit
Trending:
  • Nepal protests
  • Nepal Protests Live
  • Vice-presidential elections
  • iPhone 17
  • IND vs PAK cricket
  • Israel-Hamas war
fp-logo
Researchers Turn Web Blather to Books
Whatsapp Facebook Twitter
Whatsapp Facebook Twitter
Apple Incorporated Modi ji Justin Trudeau Trending

Sections

  • Home
  • Live TV
  • Videos
  • Shows
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Health
  • Tech/Auto
  • Entertainment
  • Web Stories
  • Business
  • Impact Shorts

Shows

  • Vantage
  • Firstpost America
  • Firstpost Africa
  • First Sports
  • Fast and Factual
  • Between The Lines
  • Flashback
  • Live TV

Events

  • Raisina Dialogue
  • Independence Day
  • Champions Trophy
  • Delhi Elections 2025
  • Budget 2025
  • US Elections 2024
  • Firstpost Defence Summit
  • Home
  • Tech
  • News & Analysis
  • Researchers Turn Web Blather to Books

Researchers Turn Web Blather to Books

fptechno • May 26, 2007, 12:00:00 IST
Whatsapp Facebook Twitter

A few simple keystrokes may soon turn blather into books.

Advertisement
Subscribe Join Us
Add as a preferred source on Google
On
Google
Prefer
Firstpost
Researchers Turn Web Blather to Books

A few simple keystrokes may soon turn blather into books.

Researchers at Carnegie Mellon University have discovered a way to enlist people across the globe to help digitize books every time they solve the simple distorted word puzzles commonly used to register at Web sites or buy things online.

The word puzzles are known as CAPTCHAs, short for ‘‘completely automated public Turing tests to tell computers and humans apart.’’ Computers can’t decipher the twisted letters and numbers, ensuring that real people and not automated programs are using the Web sites.

STORY CONTINUES BELOW THIS AD

Researchers estimate that about 60 million of those nonsensical jumbles are solved everyday around the world, taking an average of about 10 seconds each to decipher and type in.

More from News & Analysis
What is the US HIRE Bill and why is India’s $250-billion IT sector worried? What is the US HIRE Bill and why is India’s $250-billion IT sector worried? Is the internet dead? What's this theory that OpenAI's Sam Altman says might be true? Is the internet dead? What's this theory that OpenAI's Sam Altman says might be true?

Instead of wasting time typing in random letters and numbers, Carnegie Mellon researchers have come up with a way for people to type in snippets of books to put their time to good use, confirm they’re not machines and help speed up the process of getting searchable texts online.

‘‘Humanity is wasting 150,000 hours every day on these,’’ said Luis von Ahn, an assistant professor of computer science at Carnegie Mellon. He helped develop the CAPTCHAs about seven years ago. ‘‘Is there any way in which we can use this human time for something good for humanity, do 10 seconds of useful work for humanity?’’

Many large projects are under way now to digitize books and put them online, and that’s mostly being done by scanning pages of books so that people can ‘‘page through’’ the books online. In some cases, optical character recognition, or OCR, is being used to digitize books to make the texts searchable.

But von Ahn said OCR doesn’t always work on text that is older, faded or distorted. In those cases, often the only way to digitize the works is to manually type them into a computer.

Von Ahn is working with the Internet Archive, which runs several book-scanning projects, to use CAPTCHAs for this instead. Internet Archive scans 12,000 books a month and sends von Ahn hundreds of thousands of files that are images that the computer doesn’t recognize. Those files are downloaded onto von Ahn’s server and split up into single words that can be used as CAPTCHAs at sites all over the Internet.

STORY CONTINUES BELOW THIS AD

If enough users decipher the CAPTCHAs in the same way, the computer will recognize that as the correct answer.

‘‘If we can correct these books so that they are really in good shape, then you can go and use these books in other type devices more easily’’ such as handheld computers or in programs for reading to the blind, said Brewster Kahle, co-founder of the Internet Archive.

Von Ahn approached the Internet Archive to get help in developing the new system, but it has not been put into use yet. Theoretically, von Ahn said the new book-based CAPTCHAs could be used in place of any CAPTCHA currently on the Web.

The project, named reCAPTCHA, is one of many projects that enlist computer users from the community to help out. For example, Cloudmark Inc. uses its base of users to judge what is spam and what isn’t. News aggregation sites like Digg Inc.’s digg.com and Time Warner Inc.’s Netscape.com ask visitors to recommend and vote on items to go on top.

STORY CONTINUES BELOW THIS AD

For von Ahn’s project, Intel Corp. donated equipment and the work was sponsored by the MacArthur Foundation, which awarded von Ahn a ‘‘genius grant’’ last year.

Kahle, whose Internet Archive has about 200,000 books currently online, is working with libraries in three countries to digitize their books. Kahle said von Ahn’s project is ‘‘harnessing human power in exactly the right way.’’

‘‘It’s definitely a barn-raising to try to build the great library,’’ Kahle said.

Tags
keystrokes
End of Article
Latest News
Find us on YouTube
Subscribe
End of Article

Top Stories

Israel targets top Hamas leaders in Doha; Qatar, Iran condemn strike as violation of sovereignty

Israel targets top Hamas leaders in Doha; Qatar, Iran condemn strike as violation of sovereignty

Nepal: Oli to continue until new PM is sworn in, nation on edge as all branches of govt torched

Nepal: Oli to continue until new PM is sworn in, nation on edge as all branches of govt torched

Who is CP Radhakrishnan, India's next vice-president?

Who is CP Radhakrishnan, India's next vice-president?

Israel informed US ahead of strikes on Hamas leaders in Doha, says White House

Israel informed US ahead of strikes on Hamas leaders in Doha, says White House

Israel targets top Hamas leaders in Doha; Qatar, Iran condemn strike as violation of sovereignty

Israel targets top Hamas leaders in Doha; Qatar, Iran condemn strike as violation of sovereignty

Nepal: Oli to continue until new PM is sworn in, nation on edge as all branches of govt torched

Nepal: Oli to continue until new PM is sworn in, nation on edge as all branches of govt torched

Who is CP Radhakrishnan, India's next vice-president?

Who is CP Radhakrishnan, India's next vice-president?

Israel informed US ahead of strikes on Hamas leaders in Doha, says White House

Israel informed US ahead of strikes on Hamas leaders in Doha, says White House

Top Shows

Vantage Firstpost America Firstpost Africa First Sports
Latest News About Firstpost
Most Searched Categories
  • Web Stories
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Tech/Auto
  • Entertainment
  • IPL 2025
NETWORK18 SITES
  • News18
  • Money Control
  • CNBC TV18
  • Forbes India
  • Advertise with us
  • Sitemap
Firstpost Logo

is on YouTube

Subscribe Now

Copyright @ 2024. Firstpost - All Rights Reserved

About Us Contact Us Privacy Policy Cookie Policy Terms Of Use
Home Video Shorts Live TV