Misplaced Pages

Boston Library Consortium

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

The Boston Library Consortium ( BLC ) is a library consortium based in the Boston area with 26 member institutions across New England .

#670329

61-714: The Boston Library Consortium is a consortium of twenty-six institutions: sixteen in Massachusetts, five in Connecticut, one in New Hampshire, one in Rhode Island, and two in Vermont. The Internet Archive is an affiliate member. Member institutions represent a mixture of liberal arts colleges , research universities , public and private higher education institutions, special libraries , and public libraries . New members may join

122-825: A WARC file . A primary and back-up copy is stored at the Internet Archive data centers. A copy of the WARC file can be given to subscribing partner institutions for geo-redundant preservation and storage purposes to their best practice standards. Periodically, the data captured through Archive-It is indexed into the Internet Archive's general archive. As of March 2014 , Archive-It had more than 275 partner institutions in 46 U.S. states and 16 countries that have captured more than 7.4 billion URLs for more than 2,444 public collections. Archive-It partners are universities and college libraries, state archives, federal institutions, museums, law libraries, and cultural organizations, including

183-558: A backup archive in a foreign country was because of the upcoming presidency of Donald Trump . Beginning in 2017, OCLC and the Internet Archive have collaborated to make the Archive's records of digitized books available in WorldCat . Since 2018, the Internet Archive visual arts residency, which is organized by Amir Saber Esfahani and Andrew McClintock, helps connect artists with the Archive's over 48 petabytes of digitized materials. Over

244-465: A comparison with the 2023 British Library cyberattack , which affected the UK Web Archive . Beginning October 9, 2024, the Internet Archive's team, including archivist Jason Scott and security researcher Scott Helme, confirmed DDoS attacks, site defacement, and a data breach. The purported hacktivist group SN_BLACKMETA again claimed responsibility. A pop-up on the defaced site claimed that there

305-550: A database. As of September 5, 2024 , the Internet Archive held over 866 billion web pages, more than 42.5 million print materials, 13 million videos, 3 million TV news, 1.2 million software programs, 14 million audio files, 5 million images, and 272,660 concerts in its Wayback Machine. Created in early 2006, Archive-It is a web archiving subscription service that allows institutions and individuals to build and preserve collections of digital content and create digital archives. Archive-It allows

366-550: A description and tags which make them more searchable. Some file types can be previewed directly on the site, where as others have to be downloaded in order to be opened. If multiple multimedia files exist in an item, the website generates a playlist for video or audio files, or a slide show for pictures. If an item contains at least one video or picture, the Archive generates a preview thumbnail that can be seen on collection pages and in searches. Items can contain mixed data such as music files with an album cover picture, in which case

427-426: A donation of 250,000 books from Trent University in 2018, and the entire collection of Marygrove College 's library after it closed in 2020. All material is then digitized and retained in digital storage, while a digital copy is returned to the original holder and the Internet Archive's copy, if not in the public domain, is lent to patrons worldwide one at a time under the controlled digital lending (CDL) theory of

488-554: A reciprocal borrowing agreement through which faculty and other patrons affiliated with any member library can borrow materials for free from other member libraries, a shared virtual catalog and rapid delivery of materials between libraries to fulfill patron requests, cooperative purchasing of scholarly resources, and hosting of communities of interest to foster discussion and collaboration among member libraries. Past activities included cooperative collecting and sharing of materials in select subject areas, such as women's studies. In 2007,

549-491: A reusable and sustainable solution enabling controlled digital lending as a component of libraries’ interlibrary loan services." In October 2023, the BLC became the new fiscal sponsor of Project ReShare, a community of libraries, consortia, information organizations, and developers that has been developing and implementing open-source software for library resource sharing since 2018. Internet Archive The Internet Archive

610-401: A two-week loan of e-books in its controlled digital lending program for over 647,784 books not in the public domain, in partnership with over 1,000 library partners from six countries after a free registration on the web site. Open Library is a free and open-source software project, with its source code freely available on GitHub . The Open Library faces objections from some authors and

671-742: Is "preserving the print scholarly record and ensuring its availability for scholars, students and faculty." Under BLC auspices, the EAST initiative received startup grants totaling $ 1.5 million from the Andrew W. Mellon Foundation and the Davis Educational Foundation in 2014–2015. The BLC provided EAST with staffing and administrative support, technical infrastructure, financial services, and general oversight. By 2018, EAST had become self-supporting through institutional membership fees. In June 2023, EAST became an independent nonprofit organization. Since 2021,

SECTION 10

#1732848145671

732-575: Is a 501(c)(3) nonprofit operating in the United States. In 2019, it had an annual budget of $ 37 million, derived from revenue from its Web crawling services, various partnerships, grants, donations, and the Kahle-Austin Foundation . The Internet Archive also manages periodic funding campaigns. For instance, a December 2019 campaign had a goal of reaching $ 6 million in donations. It uses Ubuntu as its choice of operating system for

793-479: Is a program of the Institute of Museum and Library Services . This program supports undertakings that tackle substantial challenges and opportunities facing the field of libraries and archives . The grant is awarded to projects that support new tools, research, and services that will be widely adopted or replicated within the field: this focus is meant to extend the benefits of federal investment. Projects that develop

854-443: Is an American non-profit organization founded in 1996 by Brewster Kahle that runs a digital library website, archive.org. It provides free access to collections of digitized media including websites , software applications , music , audiovisual , and print materials. The Archive also advocates a free and open Internet . Its mission is committing to provide "universal access to all knowledge". The Internet Archive allows

915-498: Is another project of the Internet Archive. The project seeks to include a web page for every book ever published: it holds 25 million catalog records of editions. It also seeks to be a web-accessible public library: it contains the full texts of approximately 1,600,000 public domain books (out of the more than five million from the main texts collection ), as well as in-print and in-copyright books, many of which are fully readable, downloadable and full-text searchable ; it offers

976-518: Is back to normal: 1,500 requests per second". On October 20, threat actors stole unrotated API tokens and breached Internet Archive on its Zendesk email support platform; they also claimed responsibility for the other breaches yet stated that SN_BLACKMETA was behind just the DDoS attacks. On October 21, Internet Archive went back online in a read-only manner. On October 22, all Internet Archive services temporarily went offline, but later that same day, only

1037-492: Is to help preserve those artifacts and create an Internet library for researchers, historians, and scholars. In August 2012, the Archive announced that it had added BitTorrent to its file download options for more than 1.3 million existing files, and all newly uploaded files. This method is the fastest means of downloading media from the Archive, as files are served from two Archive data centers, in addition to other torrent clients which have downloaded and continue to serve

1098-541: The ARChive of Contemporary Music . A project to preserve recordings of amateur radio transmissions, with funding from the Amateur Radio Digital Communications foundation. The Live Music Archive sub-collection includes more than 170,000 concert recordings from independent musicians, as well as more established artists and musical ensembles with permissive rules about recording their concerts, such as

1159-551: The Arcadia Fund . A year later, the Internet Archive received further funding from the Arcadia Fund to invite some other university presses to partner with the Internet Archive to digitize books, a project called "Unlocking University Press Books". The Library of Congress created numerous Handle System identifiers that pointed to free digitized books in the Internet Archive. The Internet Archive and Open Library are listed on

1220-606: The Electronic Literature Organization , North Carolina State Archives and Library, Stanford University , Columbia University , American University in Cairo , Georgetown Law Library, and many others. In September 2020, Internet Archive announced a new initiative to archive and preserve open access academic journals, called Internet Archive Scholar . Its full-text search index includes over 25 million research articles and other scholarly documents preserved in

1281-455: The Google Cache yet. During the week of May 27, 2024, the Internet Archive suffered a series of distributed denial of service (DDoS) attacks that made its services unavailable intermittently, sometimes for hours at a time, over a period of several days. The attack was claimed on May 28 by a hacker group called SN_BLACKMETA , with possible links to Anonymous Sudan . The incident drew

SECTION 20

#1732848145671

1342-571: The Grateful Dead , and more recently, The Smashing Pumpkins . Also, Jordan Zevon has allowed the Internet Archive to host a definitive collection of his father Warren Zevon 's concert recordings. The Zevon collection ranges from 1976 to 2001 and contains 126 concerts including 1,137 songs. The Great 78 Project aims to digitize 250,000 78 rpm singles (500,000 songs) from the period between 1880 and 1960, donated by various collectors and institutions. It has been developed in collaboration with

1403-630: The Society of Authors , who hold that the project is distributing books without authorization and is thus in violation of copyright laws, and four major publishers initiated a copyright infringement lawsuit against the Internet Archive in June 2020 to stop the Open Library project. Many large institutional sponsors have helped the Internet Archive provide millions of scanned publications (text items). Some sponsors that have digitized large quantities of texts include

1464-605: The United States Federal Courts ' PACER electronic document system via the RECAP web browser plugin. These documents had been kept behind a federal court paywall. On the Archive, they had been accessed by more than six million people by 2013. The Archive's BookReader web app , built into its website, has features such as single-page, two-page, and thumbnail modes; fullscreen mode; page zooming of high-resolution images; and flip page animation. In October 2024,

1525-608: The first-sale doctrine . On June 1, 2020, four large publishing houses – Hachette Book Group , Penguin Random House , HarperCollins , and John Wiley – filed a lawsuit against the Internet Archive before the United States District Court for the Southern District of New York , claiming that the Internet Archive's practice of controlled digital lending constituted copyright infringement . On March 25, 2023,

1586-444: The public domain . The Archive ensured the items were attributed and linked back to Google, which never complained, while libraries "grumbled". According to Kahle, this is an example of Swartz's "genius" to work on what could give the most to the public good for millions of people. In addition to books, the Archive offers free and anonymous public access to more than four million court opinions, legal briefs, or exhibits uploaded from

1647-557: The Archive began working to provide specialized services relating to the information access needs of the print-disabled; publicly accessible books were made available in a protected Digital Accessible Information System (DAISY) format. According to its website: Most societies place importance on preserving artifacts of their culture and heritage. Without such artifacts, civilization has no memory and no mechanism to learn from its successes and failures. Our culture now produces more and more artifacts in digital form. The Archive's mission

1708-512: The Archive in May 1996, around the same time that he began the for-profit web crawling company Alexa Internet . The earliest known archived page on the site was saved on May 10, 1996, at 2:42 pm UTC (7:42 am PDT ). By October of that year, the Internet Archive had begun to archive and preserve the World Wide Web in large amounts. The archived content became more easily available to

1769-680: The Archive of Contemporary Music and George Blood Audio, responsible for the audio digitization. The Archive has a collection of freely distributable music that is streamed and available for download via its Netlabels service. The music in this collection generally has Creative Commons-license catalogs of virtual record labels. This collection contains more than 3.5 million items. Cover Art Archive , Metropolitan Museum of Art – Gallery Images, NASA Images, Occupy Wall Street Flickr Archive, and USGS Maps are some sub-collections of Image collection. National Leadership Grants for Libraries The National Leadership Grants for Libraries

1830-513: The Archive's collection; the books are identical to the copies found on Google, except without the Google watermarks, and are available for unrestricted use and download. Brewster Kahle revealed in 2013 that this archival effort was coordinated by Aaron Swartz , who, with a "bunch of friends", downloaded the public domain books from Google slowly enough and from enough computers to stay within Google's restrictions. They did this to ensure public access to

1891-442: The BLC has pursued a program for controlled digital lending . In 2022, the Davis Educational Foundation awarded the BLC a grant of $ 215,000 to "accelerate the implementation of controlled digital lending as a mechanism for interlibrary loan." In 2023, the Institute of Museum and Library Services (IMLS), a U.S. federal government agency, awarded the BLC a National Leadership Grant for Libraries totaling $ 249,221 to "support and scale

Boston Library Consortium - Misplaced Pages Continue

1952-466: The BLC if they are based in the northeastern United States and if their application is approved by a two-thirds vote of the board of directors. The BLC is funded through membership dues. The BLC is administered by an Executive Director and governed by a board of directors. Each member institution of the BLC is represented on the Board by the chief librarian of its principal library. According to its bylaws,

2013-524: The BLC partnered with the Open Content Alliance (OCA) to digitize BLC member libraries' out-of-copyright print collections and make them freely available online through the Internet Archive . To fund this initiative, the BLC and its members pledged more than $ 845,000 over two years. This partnership made the BLC the first large-scale consortium to embark on a self-funded digitization project with

2074-497: The Faxon Company to launch development of a union list of serials in 1982. BLC executive directors have included Hannah Stevens as of 2000, Barbara Preece as of 2002, Melissa Trevvett as of 2010, Susan Stearns from 2014 to 2020, and Charlie Barlow since 2020. Major BLC areas of activity include resource sharing and professional development. The BLC runs a "BLC Leads" program to foster leadership development among member library staff,

2135-471: The Internet Archive before the same United States District Court for the Southern District of New York over the Internet Archive's Great 78 Project for $ 621 million in damages from alleged copyright infringement. In September 2024, Google and the Internet Archive signed a partnership to allow people to see previous versions of websites on Google Search that uses the Wayback Machine, without linking

2196-614: The Internet Archive struck a deal with the Leiden University Library to accept the paper copies of 400,000 uncatalogued foreign dissertations held at the Library that were to be pulped – with a view to digitising them and making them accessible online. The collection includes theses by Niels Bohr , Marie Curie , Émile Durkheim , Albert Einstein , Otto Hahn , Carl Jung , J. Robert Oppenheimer , Max Planck , Luigi Pirandello , Gustav Stresemann and Max Weber . The Open Library

2257-473: The Internet Archive was operating 33 scanning centers in five countries, digitizing about 1,000 books a day for a total of more than 2 million books, in a total collection of 4.4 million books – including material digitized by others and fed into the Internet Archive; at that time, users were performing more than 15 million downloads per month. The material digitized by others includes more than 300,000 books that were contributed to

2318-664: The Internet Archive. The collection spans from digitized copies of eighteenth century journals through the latest open access conference proceedings and pre-prints crawled from the World Wide Web. In 2021, the Internet Archive announced the initial version of the General Index , a publicly available index to a collection of 107 million academic journal articles . The Archive stores files inside so-called items, which are similar to directories in that they can contain multiple files, but can have additional metadata such as

2379-476: The Library of Congress website as a source of e-books. In addition to web archives, the Internet Archive maintains extensive collections of digital media that are attested by the uploader to be in the public domain in the United States or licensed under a license that allows redistribution, such as Creative Commons licenses. Media are organized into collections by media type (moving images, audio, text, etc.), and into sub-collections by various criteria. Each of

2440-692: The OCA. In 2014, the BLC, along with the Orbis Cascade Alliance and other groups, advocated against a publisher price increase on e-books , which they feared would negatively impact academic library budgets. From 2014 to 2023, the BLC administered the Eastern Academic Scholars' Trust (EAST), a collective collection initiative across more than one hundred academic libraries throughout the eastern United States. EAST member libraries have committed to retaining over six million volumes. EAST's purpose

2501-507: The University of Toronto's Robarts Library , University of Alberta Libraries , University of Ottawa , Library of Congress , Boston Library Consortium member libraries, Boston Public Library , Princeton Theological Seminary Library , and many others. In 2017, the MIT Press authorized the Internet Archive to digitize and lend books from the press's backlist , with financial support from

Boston Library Consortium - Misplaced Pages Continue

2562-404: The Wayback Machine, Archive-It, and blog.archive.org were resumed. On October 23, archive.org, the Wayback Machine, Archive-It, and the Open Library services all resumed but with some features, such as logging in, still unavailable until the staff announced it back available in the next day or two. On October 25, the login feature is now back available for now and the site is active. The Archive

2623-411: The World Wide Web to be searched and accessed. It can be used to see what previous versions of web sites used to look like or to visit web sites that no longer even exist. The Wayback Machine was created as a joint effort between Alexa Internet (owned by Amazon.com ) and the Internet Archive. Hundreds of billions of web sites and their associated data (images, source code, documents, etc.) are saved in

2684-580: The collection, between about 2006 and 2008, by Microsoft through its Live Search Books project, which also included financial support and scanning equipment directly donated to the Internet Archive. On May 23, 2008, Microsoft announced it would be ending its Live Book Search project and would no longer be scanning books, donating its remaining scanning equipment to its former partners. Around October 2007, Archive users began uploading public domain books from Google Book Search . As of November 2013 , there were more than 900,000 Google-digitized books in

2745-546: The consortium's purpose is "to share human and information resources so that the collective strengths support and advance the research and learning of the members’ constituents." Current members include the following institutions: The BLC was founded in 1970 and officially incorporated in 1977, consisting originally of five institutions. It had grown to twelve institutions by 1993, seventeen by 2014, and nineteen by 2019. Former members include Brown University and Massachusetts Institute of Technology (MIT). The BLC partnered with

2806-485: The course of the yearlong residency, visual artists create a body of work which culminates in an exhibition. The hope is to connect digital history with the arts and create something for future generations to appreciate online or off. Previous artists in residence include Taravat Talepasand , Whitney Lynn , and Jenny Odell . The Internet Archive acquires most materials from donations, such as hundreds of thousands of 78 rpm discs from Boston Public Library in 2017,

2867-408: The court found in favor of the publishers. The negotiated judgment of August 11, 2023, barred the Internet Archive from digitally lending books for which electronic copies are on sale. Also on August 11, 2023, the music industry giants Universal Music Group , Sony Music and Concord (together with their respective labels Capitol Records , Arista Records and CMGI Recorded Music Assets) sued

2928-428: The establishment of partnerships. National Forum Grants support group efforts, such as collaborations or partnerships . These efforts typically assemble subject matter experts (SMEs) and other significant participants, including contributors from related fields, to discuss matters of current significance to libraries and archives on a nation-wide scale. Project Grants provide funding for projects that have passed

2989-557: The estimated $ 600,000 in damage. An overhaul of the site was launched as beta in November 2014, and the legacy layout was removed in March 2016. In November 2016, Kahle announced that the Internet Archive was building the Internet Archive of Canada, a copy of the Archive to be based somewhere in Canada . The announcement received widespread coverage due to the implication that the decision to build

3050-566: The files. On November 6, 2013, the Internet Archive's headquarters in San Francisco's Richmond District caught fire, destroying equipment and damaging some nearby apartments. According to the Archive, it lost a side-building housing one of 30 of its scanning centers; cameras, lights, and scanning equipment worth hundreds of thousands of dollars; and "maybe 20 boxes of books and film, some irreplaceable, most already digitized, and some replaceable". The nonprofit Archive sought donations to cover

3111-637: The general public in 2001, through the Wayback Machine . In late 1999, the Archive expanded its collections beyond the web archive, beginning with the Prelinger Archives . Now, the Internet Archive includes texts, audio, moving images, and software . It hosts a number of other projects: the NASA Images Archive, the contract crawling service Archive-It, and the wiki-editable library catalog and book information site Open Library . Soon after that,

SECTION 50

#1732848145671

3172-552: The main collections includes a "Community" sub-collection (formerly named "Open Source") where general contributions by the public are stored. The Audio Archive includes music, audiobooks , news broadcasts, old time radio shows, podcasts , and a wide variety of other audio files. As of January 2023 , there are more than 15,000,000 free digital recordings in the collection. The subcollections include audio books and poetry, podcasts, non-English audio, and many others. The sound collections are curated by B. George , director of

3233-530: The picture is used as thumbnail. Staff members of the Internet Archive organize items by placing them into so-called collections, which are pages listing multiple items. The scanning performed by the Internet Archive is financially supported by libraries and foundations. As of November 2008 , when there were approximately 1 million texts, the entire collection was greater than 500 terabytes, which included raw camera images, cropped and skewed images, PDFs , and raw OCR data. As of July 2013 ,

3294-441: The preservation and management of digital library collections. Sparks Grants are relatively small awards for swift development and evaluation of improvements in library services resulting from innovation in tools, products, or organizational practices. Planning Grants provide the funding of initial organizational activities. These include actions such as needs analysis, feasibility studies , prototyping and pilot programs , or

3355-458: The public to upload and download digital material to its data cluster, but the bulk of its data is collected automatically by its web crawlers , which work to preserve as much of the public web as possible. Its web archive , the Wayback Machine , contains hundreds of billions of web captures. The Archive also oversees numerous book digitization projects , collectively one of the world's largest book digitization efforts. Brewster Kahle founded

3416-664: The risk of data loss, the Archive creates copies of parts of its collection at more distant locations, including the Bibliotheca Alexandrina in Egypt and a facility in Amsterdam . The Archive is a member of the International Internet Preservation Consortium and was officially designated as a library by the state of California in 2007. The Wayback Machine is a service that allows archives of

3477-446: The role of libraries as community anchors and encourage civic and cultural engagement, facilitate lifelong learning , promote digital inclusion, and support economic vitality through programming and services. Programs that develop open source software applications used by libraries and archives across the United States to provide digital content and services. Initiatives that have a noteworthy national influence on shared services for

3538-418: The user to customize their capture or exclusion of web content they want to preserve for cultural heritage reasons. Through a web application, Archive-It partners can harvest, catalog, manage, browse, search, and view their archived collections. In terms of accessibility, the archived web sites are full text searchable within seven days of capture. Content collected through Archive-It is captured and stored as

3599-672: The website servers. The Archive is headquartered in San Francisco , California. From 1996 to 2009, its headquarters were in the Presidio of San Francisco , a former U.S. military base. Since 2009, its headquarters have been at 300 Funston Avenue in San Francisco, a former Christian Science Church . At one time, most of its staff worked in its book-scanning centers; as of 2019, scanning is performed by 100 paid operators worldwide. The Archive also has data centers in three Californian cities: San Francisco, Redwood City , and Richmond . To reduce

3660-477: The website was still mostly offline for "prioritizing keeping data safe at the expense of service availability." On October 11, Kahle said that the data is safe, and will bring the service back to normal "in days, not weeks." On October 13, the Wayback Machine was restored in a read-only format, while archiving web pages was temporarily disabled. On October 14, Brewster Kahle said "[the Wayback Machine] volume

3721-481: Was a "catastrophic" security breach , stating "Have you ever felt like the Internet Archive runs on sticks and is constantly on the verge of suffering a catastrophic security breach? It just happened. See 31 million of you on HIBP !" It was reported that about 31 million user accounts were affected, and compromised in a file called "ia_users.sql", dated September 28, 2024. The attackers stole users' email addresses and Bcrypt -hashed passwords. As of October 15, 2024,

SECTION 60

#1732848145671
#670329