Interested in computational text analysis, but don’t have coding experience? Or perhaps you’ve already written your own Python scripts, but you’re on the lookout for sources to build your text corpus. The Gale Digital Scholar Lab, new to the Library, offers solutions for digital humanities and digital scholarship researchers regardless of your level of technical expertise.
Create Visualizations and Run Computational Analyses in Your Web Browser
The Gale Digital Scholar Lab offers six analysis tools through which you can analyze Gale materials with just a few clicks:
- “Clustering” analyzes similar words across documents.
- “Named entity recognition” extracts proper and common nouns and groups them by types such as people, organizations, or dates.
- “Ngram” looks at the frequency of various terms or phrases.
- “Parts of speech tagger” considers how authors’ use of speech varies over time.
- “Sentiment analysis” tallies the positive or negative words in each document to produce a sentiment value.
- “Topic modeling” collects terms that frequently co-occur across a group of documents.
Download Plain-Text Files to Run Your Own Analyses
You can download up to 1000 documents at a time as plain-text files for your personal use. You can run your own analyses on this data and combine it with other text sources to build custom text corpora.
What Content Is Available?
The Gale Digital Scholar Lab includes 160 million pages of Gale Primary Sources content from the following primary source digital archives:
17th and 18th Century Burney Collection
American Civil Liberties Union Papers, 1912-1990
Archives of Sexuality & Gender
British Library Newspapers
The Economist Historical Archive
Eighteenth Century Collections Online
Indigenous Peoples: North America
The Making of Modern Law: Foreign Primary Sources
The Making of Modern Law: Foreign, Comparative, and International Law, 1600-1926
The Making of Modern Law: Legal Treatises, 1800-1926
The Making of Modern Law: Primary Sources
The Making of Modern Law: Trials, 1600-1926
The Making of the Modern World
Nineteenth Century Collections Online
Nineteenth Century U.S. Newspapers
Sabin Americana, 1500-1926
The Sunday Times Digital Archive
The Times Digital Archive
The Times Literary Supplement Historical Archive
U.S. Declassified Documents Online
- View scans of original documents side-by-side with OCR plain text
- Work iteratively with your content set to refine your results
- Easily clean your data right in the Gale Digital Scholar Lab interface and create custom text-cleaning templates
- Work with materials and tools in other languages
How to Get Started
- Visit the Gale Digital Scholar Lab
- Log in with your Google or Microsoft OneDrive credentials (a personal account is needed so you can create and save personalized datasets)
- Create your dataset by searching through the materials in the Lab.
- Run analyses on your dataset right in the web browser and get immediate results, or download your dataset to your computer to run your own scripts.
Toni Morrison, Nobel Laureate and giant of American letters, passed away yesterday, and her words will always be with us. Find her books at the Berkeley Libraries as well as scholarship on her oeuvre.
What books of Toni Morrison’s made an impression on you? Tell us on Twitter.
If you use Literature Online (LION) or the MLA International Bibliography, you may have noticed some changes recently.
The MLA International Bibliography and the MLA Directory of Periodicals will now be found solely on the EBSCO platform. Although the interface looks different, the functionality has not changed. The Bibliography indexes journal articles and other critical scholarship in literature, languages, linguistics, and folklore.
New Subcollections on Trial from Drama Online
The Library has a trial to new subcollections through our subscription to Drama Online. The trial will run through October 26th, 2018. The Library wants to hear from you! Please send your comments and feedback to sreardon at berkeley.edu.
Publish Digital Books & Open Educational Resources with Pressbooks
Wednesday, September 26, 11:10am-12:30pm
Academic Innovation Studio, Dwinelle Hall 117 (Level D)
If you’re looking to self-publish work of any length and want an easy-to-use tool that offers a high degree of customization, allows flexibility with publishing formats (EPUB, MOBI, PDF), and provides web-hosting options, Pressbooks may be great for you. Pressbooks is often the tool of choice for academics creating digital books, open textbooks, and open educational resources, since you can license your materials for reuse however you desire. Learn why and how to use Pressbooks for publishing your original books or course materials. You’ll leave the workshop with a project already under way! Register at bit.ly/dp-berk
Upcoming Workshops in this Series 2018-2019:
- The Long Haul: Best Practices for Making Your Digital Project Last
- HTML/CSS Toolkit for Digital Projects
Please see bit.ly/dp-berk for details.
Early European Books Online (EEB) is a collection of digitized European books printed in the early modern period (1450s-1700). With strong representation in Danish, Dutch, French, Italian, and Latin among many other languages, this collection will be of interest to scholars in literature, philosophy, history, and religion. Works include those by Tycho Brahe, Michelangelo Buonarrotie the Younger (nephew of the painter Michelangelo), Nostradamus, Blaise Pascal, Rene Descartes, John Calvin, and many more.
The collection is drawn from the Danish Royal Library, the National Central Library in Florence, the National Library of France, the National Library of the Netherlands, the Wellcome Library in London, and others. It complements Berkeley’s access to Early English Books Online.
Search by country of publication, language, page features (illustration, musical notation), and source library. You may include historical and linguistic variants in your search. Books can be browsed in an online Flash-based viewer or downloaded as JPEGs or PDFs. Scans are of the entire physical object and pages, including marginalia and binding. Early European Books is moving to a new platform this year, so look forward to improved speed and usability.
New Subcollections on Trial from Drama Online
The Library has a trial to new subcollections through our subscription to Drama Online. The trial will run through September 20th, 2018. The Library wants to hear from you! Please send your comments and feedback to sreardon at berkeley.edu.
Shakespeare’s Globe on Screen (2016-2018):
Shakespeare’s Globe on Screen 2 (2016-2018) is composed of streaming productions of The Duchess of Malfi, Measure for Measure, The Merchant of Venice, and Richard II from the theatre’s 2016, 2017 and 2018 seasons. This subcollection builds on our existing access to previous productions from the Globe.
The Royal Shakespeare Company Live Collection:
Recordings of live screenings of 12 productions from the Royal Shakespeare Company in Stratford-upon-Avon, including Hamlet, The Tempest, and King Lear.
Shakespeare’s Heroes & Villains:
A 1.5 hour streaming production featuring actor Steven Berkoff’s mediations on and performances of excerpts from Shakespeare’s heroes and villains.
Critical Studies & Performance Practice:
330 academic books from the Methuen Drama and The Arden Shakespeare imprints of Bloomsbury Academic, as well as over 60 actor training videos.
Aurora Metro Books:
Independent publisher with over 120 drama plays, including works from Robin Soans, Manjula Padmanabhan and Germaine Greer, as well as non-fiction books on theatre.
Playwrights Canada Press:
New plays, theatre history, criticism, and biography in Canadian theater, including French plays by Canadian authors in English translation.
About Drama Online
The UC Berkeley community already has access to much of Drama Online. Drama Online (Bloomsbury) is an online resource of primary and secondary sources for the study and performance of drama. It contains 1700 playtexts, 350 audio performances, and 150 hours of video. The platform has a web-based e-reader with page and line numbers that correspond to the print edition, download options, full text search, visualization tools including a Words and Speeches Graph and a Character Grid for each play, the ability to view lines for one specific character, genre, period, title, or playwright browsing, and annotation tools.
Content UC Berkeley already has access to:
- 1,100+ playtexts from Methuen Drama, Faber and Faber, and Arden Shakespeare, as well as contextual and critical background through scholarly works and practical guides.
- Productions from Shakespeare’s Globe On Screen
- Early modern drama titles staged and filmed specifically for educational use (Doctor Faustus, The Duchess of Malfi, Volpone and School for Scandal).
- L.A. Theatre Works audio collection
- Playscripts from theatre publisher Nick Hern
- BBC Drama Films
The celebrated American writer Philip Roth has passed away at 85 years old. Join us at the Library in re-reading old and new favorites from his incredibly prolific 53 year oeuvre.
The DH Faire is an annual Berkeley event that brings together digital humanists to share their work and reflect on the field more broadly. Including lectures, a poster session, and an evening reception, the Faire offers the Berkeley community the opportunity to share projects at various stages of development and receive invaluable feedback from peers.
Join us for the 2018 DH Faire, kicking off Monday, April 2nd!
DH Faire Reception and Poster Session
Mon. April 2, 5:00 – 7:00 PM | Morrison Library, 101 Doe
Help us kick off the 2018 DH Faire by enjoying refreshments with colleagues and browsing posters on recent DH work in Berkeley’s Morrison Library.
Digital Scholarship in the Library and Archives
Wed. April 4, 9:30 – 11:00 AM | Doe Library 180
The Library collaborates with digital humanities research and initiatives through digital publishing, digitization, preservation strategies, metadata creation, subject expertise, instruction, and much more. Come hear from librarians and archivists Mary Elings, Cody Hennesy, and Josh Quan on the ways they have engaged with digital scholarship both within the Library and on campus.
Keynote: David Bamman: The Long Rise of Word Vectors in the Digital Humanities
Wed. April 4, 4:00 -5:30 PM | Academic Innovation Studio (117 Dwinelle)
Professor David Bamman will outline the history of distributed representations of words and unpack what’s new about contemporary (neural) methods of learning such representations compared to previous approaches. By focusing on the fundamentals of representation learning, he will also discuss how we can incorporate other forms of extra-linguistic information into the representation for a word (such as time, geographical location, or author identity) and use that more complex representation for linguistic reasoning.
UC ClioMetric History Project
Zach Bleemer, Project Director
Thursday, March 8
D-Lab (Barrows 371)
2018 is the sesquicentennial of the University of California. To celebrate this milestone, UC Berkeley’s Center for Studies in Higher Education has partnered with the UC Office of the President to create the UC ClioMetric History Project (uccliometric.org), a massive data repository documenting the university system’s history. The project encompasses interactive displays visualizing 150 years of UC’s students, faculty, courses, and budgets as well as research-ready collections of data now available for use.
This presentation is part of the DH Forum series sponsored by the DH Working Group. Refreshments will be served.