Skip to main content

More right-solid
More right-solid
More right-solid
More right-solid
SHOW DETAILS
up-solid down-solid
eye
Title
Date Favorited
Creator
Microsoft Research Audio
collection
1,709
ITEMS
6,092
VIEWS
by Microsoft
collection
eye 6,092
More than 1,100 brilliant scientists and engineers push the boundaries of computing in multiple research areas in 13 research labs around the world. Discover what we've delivered to Microsoft and to the world, such as contributions to Kinect for Xbox 360, work to develop an HIV vaccine, and advancing education techniques in rural communities.
Topics: Microsoft, Research, Podcast
Modern software engineering tools exhibit a fundamental paradox: they are meant to support the collaborative activity of software development, but cause individuals and groups to work independently from one another. The underlying issue is that existing tools discretize time and tasks in concrete but isolated process steps. This approach is fundamentally flawed in assuming that human activity can be codified and that periodic resynchronization of tasks is an easy step. In this talk, I present a...
Topics: Microsoft Research, Microsoft Research Audio MP3 Archive, Rob Deline, Anita Sarma
-
search
favorite 0
-
search
favorite 0
Community Audio
by George Friedman
audio
eye 181
favorite 1
comment 0
Speech at Stratfor conference by George Friedman on conflict potential in Europe, US hegemony and suggested purpose to prevent cooperation between Russia and Germany
Topics: Stratfor, think tank, George Friedman, conflict, war, politics, Europe, Hegemony, cooperation,...
Community Data
texts
eye 137
favorite 1
comment 0
Podesta emails in .eml format with full metadata and attachments. Emails numbered from  9078  through 11107 (the seventh and eighth batches of leaked emails) compiled Oct 15, 2016 File names use the wikileaks email numbers not the original file names.  Also included is the multi threaded python script used to download the emails, it can easily be adapted by any novice (skilled enough to get a hello world to run) to quickly download future batches with just seconds of editing.  Our...
Topic: Wikileaks Podesta Clinton Email
-
search
favorite 0
ArchiveBot: The Archive Team Crowdsourced Crawler
collection
21,070
ITEMS
1.2B
VIEWS
collection
eye 1.2B
ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites). To use ArchiveBot, drop by #archivebot on EFNet. To interact with ArchiveBot, you issue commands by typing it into the channel. Note you will need channel...
Topics: archiveteam, archivebot, webcrawl, robot, love
Archive Team: The Xanga Conga
collection
455
ITEMS
19.3M
VIEWS
collection
eye 19.3M
Xanga /ˈzæŋɡə/ is a website that hosts weblogs, photoblogs, and social networking profiles. It is operated by Xanga.com, Inc., based in New York City. In September of 2013 Xanga relaunched under the assumed name of Xanga 2.0. Xanga/Xanga 2.0 is no longer a free blogging webspace. Users will now have to pay an annual fee of $48.00. The intellectual property of many users has since been lost. Xanga only saved archives from users that posted in the last five years (2008-2013). On their...
Topics: Xanga, Doomed, Archive Team
The Archive Team Just In Time Grabs
collection
61,800
ITEMS
24.6M
VIEWS
collection
eye 24.6M
The hardest part about our transient, shallow world wide web is the terrifying swiftness in which data disappears. To this end, Archive Team members have often bravely strapped on miner's helmets and flashlights, dove into the flaming wreckage of a dying site, and grabbed a copy for all of time. Some of these rescues, consisting of what we could grab, are being saved here. Please Note: Some of these items were not burning as brightly or recently as others - they might be merely considered...
Archive Team: The Pomf Pillow Fight
collection
143
ITEMS
33.5M
VIEWS
collection
eye 33.5M
POMF is a sound effect and onomatopoeia describing the sound someone makes as they fall onto a bed or a similar surface. It is commonly described through the symbolia =3 and combined with the catchphrase “What are we gonna do on the bed?”. On the internet, the term has gained usage in both verbal and image variations and is often used as an exploitable.
Archive Team: The Wretch Wrench
collection
2,165
ITEMS
43.7M
VIEWS
collection
eye 43.7M
Wretch (Chinese: 無名小站; pinyin: wúmíng xiǎo zhàn) is a Taiwanese community web site; in Chinese, its name means Nameless Little Site. It is the most well-known blog community in Taiwan with thousands of users registered. Wretch provides free photo album, and blog hosting services. Four languages, including English, are available. A more extensive VIP version is offered. It is the top visited site in Traditional Chinese languages and the second in Taiwan after Yahoo Taiwan according...
The Archive Team Just In Time Grabs
web
eye 436,528
favorite 1
comment 0
ArchiveBot: The Archive Team Crowdsourced Crawler
by Archive Team
web
eye 515,700
favorite 1
comment 0
ArchiveBot is an Archive Team service to quickly grab smaller at-risk or critical sites to bring copies into the Internet Archive Wayback machine.
Archive Team: The Silenced Yahoo! Voices
collection
30
ITEMS
575,440
VIEWS
collection
eye 575,440
Yahoo! Voices, formerly Associated Content (AC), is a division of Yahoo that focuses on online publishing. Yahoo! Voices distributes a large variety of writing through its website and content partners, including Yahoo! News. In early December, 2011, its owners Yahoo announced a major shakeup involving the introduction of a new service, Yahoo! Voices, which would replace the Associated Content site and take on the bulk of its content, while some 75,000 items would be retired under the new site's...
Archive Team
collection
580,872
ITEMS
2B
VIEWS
collection
eye 2B
Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history. History is littered with hundreds of conflicts over the future of a community, group, location or...
The Archive Team Friendster Snapshot Collection
collection
143
ITEMS
22,361
VIEWS
collection
eye 22,361
Founded in 2002 by Jonathan Abrams and Peter Chin, Friendster was one of the more popular social networking sites, predating later services like Facebook and MySpace. It provided a singular platform to share music, writings, photographs and profiles between a growing amount of users, which grew to roughly 112 million over 9 years. Unlike previous sites like Geocities, Angelfire and Tripod, Friendster allowed larger spaces for upload of data, resulting in exponential growth and a leading...
Cuil Crawl Data
collection
0
ITEMS
22M
VIEWS
collection
eye 22M
Web crawl snapshot generously donated from cuil.com . This collection of pages mostly from 2007 and some from 2008, is about 310 terabytes of compressed data, and almost 60 billion URLs (mostly text). Cuil was a search engine that organized web pages by content and displayed relatively long entries along with thumbnail pictures for many results. Cuil said it had a larger index than any other search engine, with about 120 billion web pages. It went live on July 28, 2008. Cuil's servers were shut...
Ourmedia
by Sicuani Noticias
data
eye 38,047
favorite 1
comment 0
Favicon de Sicuani Noticias
Topics: sicuani, sicuani noticias, burrito, favicon, favisicuani
Community Data
data
eye 36,628
favorite 1
comment 0
feed css
Topic: control feed
Internet Archive Disk Monitoring
collection
427
ITEMS
33,408
VIEWS
by parker@archive.org
collection
eye 33,408
SMART data taken from Internet Archive disk drives.
IRS 990 Forms
collection
924
ITEMS
161,893
VIEWS
collection
eye 161,893
This collection contains images of IRS 990 forms submitted to the IRS by U.S. nonprofit organizations. Annual 990 forms are submitted to the IRS by organizations exempt from U.S. income tax, those seeking to establish that status and some charitable trusts. They disclose certain activities, income, expenditures, assets, liabilities and senior leadership. Items in this collection are organized in two ways: Some items are archived as an"ISO image" file (a disk image) containing the...
Open Library Data
data
eye 0
favorite 0
comment 1
Book MARC Records that were generously donated from the Scriblio project (http://about.scriblio.net) at Plymouth State University. These records are originally from the Library of Congress for their holdings.
( 1 reviews )
Ourmedia
data
eye 2.3M
favorite 3
comment 0
supporting images for the archive website moved into an item by tracey.
The Internet Archive Software Collection
data
eye 97,467
favorite 6
comment 0
The goal is to provide a ubiquitous, flexible, comprehensive-as-possible emulator that will appear in as many browsers as possible without installing a plugin or runtime. While a number of emulation solutions exist that allow much of what is wanted, they nearly all require plugins and most are directed towards a single machine or small sets of machines. Currently, the most flexible runtime is current versions of Javascript, a horribly named runtime that utilizes a Turing-complete programming...
Topics: Emulator, Emulation, Mess
Community Texts
data
eye 258,542
favorite 2
comment 0
The WebBase Archive
collection
842
ITEMS
4.9M
VIEWS
collection
eye 4.9M
Usenet Archive
collection
26,357
ITEMS
1.3M
VIEWS
collection
eye 1.3M
Usenet is a worldwide distributed Internet discussion system. It was developed from the general purpose UUCP dial-up network architecture. Duke University graduate students Tom Truscott and Jim Ellis conceived the idea in 1979 and it was established in 1980. Users read and post messages (called articles or posts, and collectively termed news) to one or more categories, known as newsgroups. Usenet resembles a bulletin board system (BBS) in many respects, and is the precursor to Internet forums...
Arkiver Crawls
collection
464
ITEMS
4.2M
VIEWS
collection
eye 4.2M
Arkiver is a volunteer for downloading and preserving websites from the internet. The downloaded websites are uploaded to the Interet Archive for the Wayback Machine. The crawls are crawls of dying websites, important websites and any other websites worth to be preserved.
Topics: Arkiver, Web, Crawls
Shareware CD-ROMs
software
eye 631
favorite 3
comment 0
Shareware CD-ROMs
software
eye 786
favorite 2
comment 0
Shareware CD-ROMs
software
eye 323
favorite 2
comment 0
Giganews Usenet Collection
collection
25,329
ITEMS
1.2M
VIEWS
collection
eye 1.2M
Giganews , the world’s leading Usenet access provider, and Internet Archive have partnered to preserve text newsgroups, which include discussions between people around the globe on virtually every subject imaginable. Giganews serves consumers, Internet service providers, telecommunications companies, and multi-service operators in more than 196 different countries. As the industry leader in Usenet access service, on any given day tens of millions of Usenet postings are uploaded to Giganews'...
Source: https://github.com/jjjake/giganews
Data Collection
collection
761,892
ITEMS
376.8M
VIEWS
collection
eye 376.8M
Data Collection
by Internet Archive (Jake Johnson, Coder)
data
eye 701
favorite 3
comment 0
This item contains scripts and output from a census done on 2015-03-04 to determine the size of all of the Internet Archive's non-derivative files from public items. metamgr-norm-ids-20150304205357.txt.gz is an itemlist containing all public Archive.org items as of 2015-03-04T20:53:57. This list was used as input for the following command: ./ia-mine-0.5-py3.3.pex metamgr-norm-ids-20150304205357.txt --workers 600 2>/dev/null | pv -lacbrN 'mine' | ./parallel-chunks.sh jq -c -r -f...
Topics: internetarchive.BAK, archive.org census
Community Audio
by Michael Corbin
audio
eye 73
favorite 1
comment 0
The ParaNet Continuum radio show #72 (August 27, 1995) hosted by Michael Corbin and featuring Don Ecker , research director for UFO Magazine . Show length 52:57.
Topics: Don Ecker, ParaNet Continuum, UFO Magazine, UFOs, Unidentified Flying Objects
-
search
favorite 0
-
search
favorite 0
The Archive Team FortuneCity Rescue
image
eye 37
favorite 1
comment 0
A collection of users from the FortuneCity Rescue that were retried and returned variant sizes and information from other attempts.
The FTP Site Boneyard
software
eye 86
favorite 1
comment 0
The FTP Site Boneyard
collection
819
ITEMS
135,612
VIEWS
collection
eye 135,612
This collection contains .tar or .zip files of the collections of these sites, which are then browsable using the Internet Archive's archive view functionality. Created in 1971 (and refined in 1985), the File Transfer Protocol allowed Internet or network-connected computers to transfer binary and ASCII files between each other. To facilitate transferring of files in a pre-WWW era, FTP sites allowing anonymous or open-access connections became available worldwide. As they were often connected to...
The FTP Site Boneyard
software
eye 1,593
favorite 1
comment 0
Full download of the entire contents of FTP.LOTUS.COM, the support FTP site for legacy Lotus products by IBM. Besides the Domino, Notes, 1-2-3 and other projects, a number of shareware and historical documents and programs are included. In recognition of IBM's move away from the Lotus brand after acquisition in the 1990s, this mirror has been generated to ensure historical availability. It is easiest to browse this .zip file using the online browser . Textfiles of the contents have also been...
National Security Internet Archive (NSIA)
collection
2,399,040
ITEMS
42.2M
VIEWS
collection
eye 42.2M
The National Security Internet Archive focuses on files collected from That 1 Archive , MuckRock , NARA, the National Security Archive at GWU, Hood College, the Black Vault , the Government Attic , Paperless Archives, Ernie Lazar, the International Center for 9/11 Studies as well as various other historians, collectors and activists.
Topics: Government, Government documents, FOIA, Freedom of Information Act, National security, Law...