Be careful what you tweet, it could end up in the US national archives



WASHINGTON // The Library of Congress, repository of the world's largest collection of books, has set for itself the enormous task of archiving something less weighty and far more ephemeral - billions of tweets.

The venerable US institution is assembling all of the 400 million tweets sent by Americans each day, in the belief that each of the mini-messages reflect a small but important part of the national narrative.

"An element of our mission at the Library of Congress is to collect the story of America, and to acquire collections that will have research value," according to Gayle Osterberg, the director of communications at the library.

The Library of Congress, located off the National Mall in Washington, houses millions of hard copy books and historic documents, and its online archives amass millions of additional works produced by Americans for more than two centuries.

Now it wants to be keeper of the nation's brief internet messages as well: Twitter in April 2010 inked a deal with the library, giving it access to tweets dating back to the company's inception in 2006.

Collecting the 140-character micro-missives, said Ms Osterberg, is in keeping with the library's main goal "to collect the story of America and to acquire collections that will have research value".

One major challenge to the Library, however, is storing the messages from the popular social messaging site, which now number 170 billion. Twitter last month said the number of active users on the messaging platform has topped 200 million, most of whom are in the United States.

Tweets that have been deleted or that are locked will not be among those gathered by the Library of Congress.

Among the messages to be preserved for posterity are the first-ever tweets sent by one of the company's founders, Jack Dorsey.

Also saved for all time is a famous tweet sent by President Barack Obama after his historic November 2008 victory to claim the White House in his first term.

"We just made history. All of this happened because you gave your time, talent and passion. All of this happened because of you. Thanks," read the micro-message from the tech-savvy US president.

Unlike traditional bound books or even digital web pages, the real challenge of preserving tweets is keeping up with their number, which has continued to grow almost exponentially.

There were 140 million tweets sent each day in February 2011, but more than three times as many - about a half billion - by October 2012.

The Library of Congress's tweets are being stored by Gnip, a social-media aggregation company headquartered in Boulder, Colorado, which has put more than 133,000 gigabytes of storage space available.

Gnip says it is a particular challenge to gather tweets during "peak" times, such as news event watched the world over like the Japanese tsunami in March 2011, which generated many thousand tweets per second.

It has proven to be a Herculean challenge for Gnip to make tweets accessible to all those who wish to view them.

So far it has been unable to meet the demands of researchers worldwide who hope to access the archive. Even a search among the first four years of tweets, from 2006 to 2010, could take about 24 hours.

"It is clear that technology to allow for scholarship access to large data sets is lagging behind technology for creating and distributing such data," said a recent white paper published by the Library of Congress.

"This is an inadequate situation," the Library concluded, calling the massive archiving project "prohibitively costly".

Yet Lee Humphreys, a professor of communication at Cornell University in New York, said that the brief online messages can reveal volumes "about the culture where they were produced".

Disturbing facts and figures

51% of parents in the UAE feel like they are failing within the first year of parenthood

57% vs 43% is the number of mothers versus the number of fathers who feel they’re failing

28% of parents believe social media adds to the pressure they feel to be perfect

55% of parents cannot relate to parenting images on social media

67% of parents wish there were more honest representations of parenting on social media

53% of parents admit they put on a brave face rather than being honest due to fear of judgment

Source: YouGov

Specs: 2024 McLaren Artura Spider

Engine: 3.0-litre twin-turbo V6 and electric motor
Max power: 700hp at 7,500rpm
Max torque: 720Nm at 2,250rpm
Transmission: Eight-speed dual-clutch auto
0-100km/h: 3.0sec
Top speed: 330kph
Price: From Dh1.14 million ($311,000)
On sale: Now

Five expert hiking tips
  • Always check the weather forecast before setting off
  • Make sure you have plenty of water
  • Set off early to avoid sudden weather changes in the afternoon
  • Wear appropriate clothing and footwear
  • Take your litter home with you
Company Profile

Name: Direct Debit System
Started: Sept 2017
Based: UAE with a subsidiary in the UK
Industry: FinTech
Funding: Undisclosed
Investors: Elaine Jones
Number of employees: 8

EMIRATES'S REVISED A350 DEPLOYMENT SCHEDULE

Edinburgh: November 4 (unchanged)

Bahrain: November 15 (from September 15); second daily service from January 1

Kuwait: November 15 (from September 16)

Mumbai: January 1 (from October 27)

Ahmedabad: January 1 (from October 27)

Colombo: January 2 (from January 1)

Muscat: March 1 (from December 1)

Lyon: March 1 (from December 1)

Bologna: March 1 (from December 1)

Source: Emirates

The specs: 2019 Infiniti QX50

Price, base: Dh138,000 (estimate)
Engine: 2.0L, turbocharged, in-line four-cylinder
Transmission: Continuously variable transmission
Power: 268hp @ 5,600rpm
Torque: 380Nm @ 4,400rpm
Fuel economy: 6.7L / 100km (estimate)

COMPANY PROFILE

Company name: Klipit

Started: 2022

Founders: Venkat Reddy, Mohammed Al Bulooki, Bilal Merchant, Asif Ahmed, Ovais Merchant

Based: Dubai, UAE

Industry: Digital receipts, finance, blockchain

Funding: $4 million

Investors: Privately/self-funded

Bournemouth 0

Manchester United 2
Smalling (28'), Lukaku (70')

SPEC SHEET: NOTHING PHONE (2A)

Display: 6.7-inch flexible Amoled, 2,412 x 1,080, 394ppi, 120Hz, Corning Gorilla Glass 5

Processor: MediaTek Dimensity 7,200 Pro, 4nm, octa-core

Memory: 8/12GB

Capacity: 128/256GB

Platform: Android 14, Nothing OS 2.5

Main camera: Dual 50MP main, f/1.88 + 50MP ultra-wide, f/2.2; OIS, EIS, auto-focus, ultra XDR, night mode

Main camera video: 4K @ 30fps, full-HD @ 60fps; slo-mo full-HD at 120fps

Front camera: 32MP wide, f/2.2

Battery: 5,000mAh; 50% in 30 minutes with 45-watt charger

Connectivity: Wi-Fi, Bluetooth 5.3, NFC (Google Pay)

Biometrics: Fingerprint, face unlock

I/O: USB-C

Durability: IP54, limited protection from water/dust

Cards: Dual-nano SIM

Colours: Black, milk, white

In the box: Nothing Phone (2a), USB-C-to-USB-C cable, pre-applied screen protector, Sim tray ejector tool

Price (UAE): Dh1,199 (8GB/128GB) / Dh1,399 (12GB/256GB)

MEDIEVIL (1998)

Developer: SCE Studio Cambridge
Publisher: Sony Computer Entertainment
Console: PlayStation, PlayStation 4 and 5
Rating: 3.5/5

Company profile

Name: Tabby
Founded: August 2019; platform went live in February 2020
Founder/CEO: Hosam Arab, co-founder: Daniil Barkalov
Based: Dubai, UAE
Sector: Payments
Size: 40-50 employees
Stage: Series A
Investors: Arbor Ventures, Mubadala Capital, Wamda Capital, STV, Raed Ventures, Global Founders Capital, JIMCO, Global Ventures, Venture Souq, Outliers VC, MSA Capital, HOF and AB Accelerator.

Fifa World Cup Qatar 2022

First match: November 20
Final 16 round: December 3 to 6
Quarter-finals: December 9 and 10
Semi-finals: December 13 and 14
Final: December 18


Latest
Most Read
Top Videos

View from DC

The inside scoop from The National’s Washington bureau

      By signing up, I agree to The National's privacy policy
      View from DC