Interested in computational text analysis, but don’t have coding experience? Or perhaps you’ve already written your own Python scripts, but you’re on the lookout for sources to build your text corpus. The Gale Digital Scholar Lab, new to the Library, offers solutions for digital humanities and digital scholarship researchers regardless of your level of technical expertise.
Create Visualizations and Run Computational Analyses in Your Web Browser
The Gale Digital Scholar Lab offers six analysis tools through which you can analyze Gale materials with just a few clicks:
- “Clustering” analyzes similar words across documents.
- “Named entity recognition” extracts proper and common nouns and groups them by types such as people, organizations, or dates.
- “Ngram” looks at the frequency of various terms or phrases.
- “Parts of speech tagger” considers how authors’ use of speech varies over time.
- “Sentiment analysis” tallies the positive or negative words in each document to produce a sentiment value.
- “Topic modeling” collects terms that frequently co-occur across a group of documents.
Download Plain-Text Files to Run Your Own Analyses
You can download up to 1000 documents at a time as plain-text files for your personal use. You can run your own analyses on this data and combine it with other text sources to build custom text corpora.
What Content Is Available?
The Gale Digital Scholar Lab includes 160 million pages of Gale Primary Sources content from the following primary source digital archives:
17th and 18th Century Burney Collection
American Civil Liberties Union Papers, 1912-1990
Archives of Sexuality & Gender
British Library Newspapers
The Economist Historical Archive
Eighteenth Century Collections Online
Indigenous Peoples: North America
The Making of Modern Law: Foreign Primary Sources
The Making of Modern Law: Foreign, Comparative, and International Law, 1600-1926
The Making of Modern Law: Legal Treatises, 1800-1926
The Making of Modern Law: Primary Sources
The Making of Modern Law: Trials, 1600-1926
The Making of the Modern World
Nineteenth Century Collections Online
Nineteenth Century U.S. Newspapers
Sabin Americana, 1500-1926
The Sunday Times Digital Archive
The Times Digital Archive
The Times Literary Supplement Historical Archive
U.S. Declassified Documents Online
- View scans of original documents side-by-side with OCR plain text
- Work iteratively with your content set to refine your results
- Easily clean your data right in the Gale Digital Scholar Lab interface and create custom text-cleaning templates
- Work with materials and tools in other languages
How to Get Started
- Visit the Gale Digital Scholar Lab
- Log in with your Google or Microsoft OneDrive credentials (a personal account is needed so you can create and save personalized datasets)
- Create your dataset by searching through the materials in the Lab.
- Run analyses on your dataset right in the web browser and get immediate results, or download your dataset to your computer to run your own scripts.