Digital Archives and the DH Working Group on Nov. 4

To my delight, I can now announce that the next Digital Humanities Working Group at UC Berkeley is November 4 at 1pm in Doe Library, Room 223.

For the workshop, we have two amazing speakers for lightning talks. They are:

Danny Benett, MA Student in Folklore, will discuss the Berkeley folklore archive which is making ~500,000 folklore items digitally accessible.

Adrienne Serra, Digital Projects Archivist at The Bancroft Library, will demo an interactive map in ArcGIS allowing users to explore digital collections about the Spanish and Mexican Land grants in California.

We hope to see you there! Do consider signing up (link) as we order pizza and like to have loose numbers.

The UC Berkeley Digital Humanities Working Group is a research community founded to facilitate interdisciplinary conversations in digital humanities and cultural analytics. It is a welcoming and supportive community for all things digital humanities. Presenters Danny Benett, MA Student in Folklore, will discuss the Berkeley folklore archive which is making ~500,000 folklore items digitally accessible. Adrienne Serra, Digital Projects Archivist at The Bancroft Library, will demo an interactive map in ArcGIS allowing users to explore digital collections about the Spanish and Mexican Land grants in California. SIGN UP
Flyer with D-Lab and Data & Digital Scholarship’s Digital Humanities Working Group, November 4 @ 1pm session.

The UC Berkeley Digital Humanities Working Group is a research community founded to facilitate interdisciplinary conversations in digital humanities and cultural analytics. It is a welcoming and supportive community for all things digital humanities.

The event is co-sponsored by the D-Lab and Data & Digital Scholarship Services.


New Resources in Literature

three quarter image of doe library
Three quarter view of the East and North facades.” Daniel L. Lu, CC BY-SA 4.0

by Taylor Follett

Fall semester is always a time of fresh beginnings — new classes, new faces, and most excitingly for those of us at the library, access to new resources. We hope that the following new databases, books, journals, and much more will be of value to those studying literature. Here are some highlights for undergraduates, graduate students, and professors alike.

Continue reading “New Resources in Literature”


Faces in the Crowd

In the Bancroft Library Pictorial Unit, work continues on 115 panoramic Cirkut camera negatives being conserved and scanned as part of our NEH-funded work on the Edward A. Rogers Panama-Pacific International Exposition Photograph Collection.

Panoramic photo of crowd inside Fillmore Street Gate and the Panama Pacific Exposition, Nov. 2, 1915.
Fillmore Street Gate, San Francisco Day, Nov. 2, 1915. Scanned from a nitrate film negative, approximately 8 x 45 inches. (Edward A. Rogers Panama-Pacific International Exposition photograph collection, BANC PIC 2015.013:01134P—NNEG, The Bancroft Library, U.C. Berkeley.)

 

The digital images produced give the chance to peer into these panoramic scenes and pick out small details – and often our gaze is returned by characters in the crowd, caught some 102 years ago.

Bald man in crowd tipping his hat.
Detail of BANC PIC 2015.013:01134P—NNEG. A tip of the hat.

 

The panorama (pictured above) at the Fillmore Street Gate on San Francisco Day, November 2, 1915, is among the best crowd shots, and all the images in this posting are details from it. At center the throng recedes eastward into the distance, down the thoroughfare of popular amusements known as The Zone. At left the crowds fill the Avenue of Progress which leads toward the bay, past the Machinery Palace. At right are the entrance gates, with the ridge of the Pacific Heights neighborhood beyond.

Detail of crowd under the San Francisco Day banner at the entrance to The Zone.
Detail of BANC PIC 2015.013:01134P—NNEG

 

 

In the crowd there are so many marvelous faces (not to mention terrific hats!) that it is hard to select favorites.

Asian family in crowd.
Detail of BANC PIC 2015.013:01134P—NNEG

For a “world’s fair” there’s not a lot of diversity in this crowd. But this stylin’ family are holding their own.

###

Grinning man with a box camera.
Detail of BANC PIC 2015.013:01134P—NNEG

This fellow’s bound to have a good time, and he’s ready to make memories with his handy portable box camera at the ready.

###

Young boy in crowd selling balloons.
Detail of BANC PIC 2015.013:01134P—NNEG

This kid seems to have just made a balloon sale, but it’s serious work.

###

Stern older woman in had with light veil.
Detail of BANC PIC 2015.013:01134P—NNEG

When mixing with hoi polloi, veils and a no-nonsense attitude are necessities for some. Even at a fair.

ESPECIALLY at a fair.

###

Smiling woman in a large hat with long feather decorations on it.
Detail of BANC PIC 2015.013:01134P—NNEG

This lady is smiling even though she’s enjoying neither an ice cream nor a cigar. Perhaps she knows her hat is at the cutting edge.

It will be over 40 years before Sputnik challenges her design-forward look.

###

Three odd looking men in crowd.
Detail of BANC PIC 2015.013:01134P—NNEG

Three distinct kinds of trouble.
Make that four.

###

Women in hats in crowd.
Detail of BANC PIC 2015.013:01134P—NNEG

With all the fine hats, how can we choose a winner? – But wait! – Never mind.

The wee chap on the right steals the show!

###

Woman in large hat with children, and a small girl with purse behind her.
Detail of BANC PIC 2015.013:01134P—NNEG

And this favorite auntie’s outstanding chapeau falls victim to another well-accessorized scene-stealer.

###

Work on the Rogers Panama-Pacific International Exposition collection will continue through June of 2018, at which time digital images from over 2,000 negatives will be put online. In the meantime, we will share favorites, along with project updates, on this Bancroft Pictorial Unit blog. Check back again!

James Eason, Archivist for Pictorial Collections, Bancroft Library


Panoramas Revealed: 1915 Panama-Pacific International Exposition Photographic Negatives

This year staff in the Bancroft Pictorial Unit have been hard at work housing and preparing to digitize glass plate negatives from the Edward A. Rogers Panama-Pacific International Exposition (PPIE) Photograph Collection. Supported by funding awarded by the National Endowment for the Humanities (NEH), about 2,000 glass negatives and 115 panoramic film negatives will be put in order, housed in archival sleeves and boxes, listed, and scanned. Although the work will not be complete and online until June 2018, great progress has been made, and we are starting to see some of the images produced by our digital imaging technicians.

The Rogers Collection was a gift presented in late 2014, just months before the centennial of the opening of San Francisco’s great world’s fair. In addition to the negatives (filling about 40 large boxes), there are also huge ledger books containing about 6,700 photographic prints. These originally served as a visual inventory of the negatives, which were mostly produced by the Cardinell-Vincent Company of San Francisco, official photographers for the PPIE. (Others are by the H.S. Crocker Company that previously held the PPIE photo contract.)

The Cardinell-Vincent photograph archive was broken up many decades ago, with much of it sold off in small auction lots in 1979; but Ed Rogers had collected this material well before that sale. In 2014 his was believed to be the largest PPIE photo collection in private hands – and certainly is the largest quantity of glass negatives known to have survived.

The most challenging images to conserve and digitize are the 115 panoramic negatives. These sweeping views and group portraits, made with a pivoting “Cirkut camera,” are on flammable cellulose nitrate film. Handling, transportation, and storage must meet stringent safety requirements. The rolled negatives were soiled from years of warehouse storage, so they are being cleaned by photograph conservators. They are so large (eight or ten inches high and up to 60 inches long!) that they need to be digitally photographed in segments, and these segments are digitally merged to create a file reproducing the original view.
The first scans from these panoramic negatives have been delivered, and they do not disappoint. The broad views over the bay-front PPIE site, just inside the Golden Gate, are stunning.

Panoramic photograph: overview of completed PPIE grounds and the bay, May 1915.
Panoramic overview of completed PPIE grounds and the bay, early May, 1915. Scanned from nitrate negative, approximately 8 x 42 inches. (Edward A. Rogers Panama-Pacific International Exposition photograph collection, BANC PIC 2015.013:00414P—NNEG, The Bancroft Library, U.C. Berkeley.)

 

There is enough detail present to zoom in and closely study segments of the view.

Detail of photograph, showing PPIE Tower of Jewels, from center portion of panorama.
Detail: Tower of Jewels and South Garden, with Alcatraz Island at right, from right of center of Panoramic overview of completed PPIE grounds and the bay, early May, 1915. . (Edward A. Rogers Panama-Pacific International Exposition photograph collection, BANC PIC 2015.013:00414P—NNEG, The Bancroft Library, U.C. Berkeley.)

 

Detail of panoramic photograph, showing the Marin County coastline across the bay from the PPIE site.
Detail: Marin County coastline, where the north tower of the Golden Gate Bridge would later be built, from left side of Panoramic overview of completed PPIE grounds and the bay, early May, 1915. (Edward A. Rogers Panama-Pacific International Exposition photograph collection, BANC PIC 2015.013:00414P—NNEG, The Bancroft Library, U.C. Berkeley.)

 

Even the more prosaic group portraits offer great detail and often capture candid moments at the fringes of the crowd. Some of the crowd views are the most entertaining, and place the viewer in festive moment captured 102 year ago.

 

Panoramic photograph: Turning on the Fountain of Energy, Opening Day Panama Pacific International Exposition, February 20, 1915
Turning on the Fountain of Energy, Opening Day Panama Pacific International Exposition, February 20, 1915. Scanned from nitrate negative, approximately 10 x 42 inches. (Edward A. Rogers Panama-Pacific International Exposition photograph collection, BANC PIC 2015.013:00006P—NNEG, The Bancroft Library, U.C. Berkeley.)

 

Detail of left portion of panoramic photo: crowd scene while "Turning on the Fountain of Energy, Opening Day"
Detail: Festival Hall and crowds, with large camera at right, from left portion of “Turning on the Fountain of Energy, Opening Day,” February 20, 1915. (Edward A. Rogers Panama-Pacific International Exposition photograph collection, BANC PIC 2015.013:00006P—NNEG, The Bancroft Library, U.C. Berkeley.)

 

As more digitization is completed we will share favorites, along with project updates, on this Bancroft Pictorial Unit blog. Stay tuned!

James Eason, Archivist for Pictorial Collections, Bancroft Library


Gallica Gives

"Ça, mon enfant, c'est du pain.. [That, my child, is bread...]" par Gottlob in L'Assiette au beurre (1901)

Online since 1997, Gallica remains one of the major digital libraries available for free on the Internet. With more than 12 million high-resolution digital objects from the collections of the Bibliothèque Nationale de France (BnF) as well as from hundreds of partner institutions, it includes books, journals, newspapers, manuscripts, maps, images, audio files, and more. The illustration above “Ça, mon enfant, c’est du pain.. [That, my child, is bread…]” by Fernand-Louis Gottlob was published in one of the first issues of the weekly satirical magazine L’Assiette au beurre (1901-1936) which is also held in print at UC Berkeley. Committed to the ever-evolving needs of its user community, Gallica’s social media outlets include Facebook, Twitter, Pinterest and even a BnF app.


Library Leaders Forum 2016

2016-10-27 Libraries Leaders-162
The aerial shot of the group at the Library Leaders Forum 2016 by Brad Shirakawa

On October 26-28, I had the honor of attending the Library Leaders Forum 2016, which was held at the Internet Archive (IA). This year’s meeting was geared towards envisioning the library of 2020. October 26th was also IA’s 20th anniversary. I joined my Web Science and Digital Libraries (WS-DL) Research Group in celebrating IA’s 20 years of preservation by contributing a blog post with my own personal story, which highlights a side of the importance of Web preservation for the Egyptian Revolution. More personal stories about Web archiving exist on WS-DL blog.

img_6959
Brewster Kahle opens the Library Leaders Forum 2016

In the Great room at the Internet Archive Brewster Kahle, the Internet Archive’s Founder, kicked off the first day by welcoming the attendees. He began by highlighting the importance of openness, sharing, and collaboration for the next generation. During his speech he raised an important question, “How do we support datasets, the software that come with it, and open access materials?” According to Kahle, the advancement of digital libraries requires collaboration.

img_7017
IA’s Golden Floppy

After Brewster Kahle’s brief introduction, Wendy Hanamura, the Internet Archive’s Director of Partnership, highlighted parts of the schedule and presented the rules of engagement and communication:

  • The rule of 1 – Ask one question answer one question.
  • The rule of n – If you are in a group of n people, speak 1/n of the time.

Before giving the microphone to the attendees for their introductions, Hanamura gave a piece of advice, “be honest and bold and take risks“. She then informed the audience that “The Golden Floppy” award shall be given to the attendees who would share bold or honest statements.

Next was our chance to get to know each other through self-introductions. We were supposed to talk about who we are, where we are from and finally, what we want from this meeting or from life itself. The challenge was to do this in four words.

img_6964
“Partnership is important for advancing the library system”, Sylvain Belanger.

After the introductions, Sylvain Belanger, the Director of Preservation of Library and Archives in Canada, talked about where his organization will be heading in 2020. He mentioned the physical side of the work they do in Canada to show the challenges they experience. They store, preserve, and circulate over 20 million books, 3 million maps, 90,000 films, and 500 sheets of music.

We cannot do this alone!” Belanger exclaimed. He emphasized how important a partnership is to advance the library field. He mentioned that the Library and Archives in Canada is looking to enhance preservation and access as well as looking for partnerships. They would also like to introduce the idea of innovation into the mindset of their employees. According to Belanger, the Archives’ vision for the year 2020 includes consolidating their expertise as much as they can and also getting to know how do people do their work for digitization and Web archiving.

After the Belanger’s talk, we split up into groups of three to meet other people we didn’t know so that we could exchange knowledge about what we do and where we came from. Then the groups of two will join to form a group of six that will exchange their visions, challenges, and opportunities. Most of the attendees agreed on the need for growth and accessibility of digitized materials. Some of the challenges were funding, ego, power, culture, etc.

Chris Edwards from the Getty Research Institute.
Chris Edwards from the Getty Research Institute.

Chris Edward, the Head of Digital Services at the Getty Research Institute, talked about what they are doing, where they are going, and the impact of their partnership with the IA. Edward mentioned that the uploads by the IA are harvested by HathiTrust and the Defense Logistics Agency (DLA). This allows them to distribute their materials. Their vision for 2020 is to continue working with the IA and expanding the Getty research portal, and digitize everything they have and make it available for everyone, anywhere, all the time. They also intend on automating metadata generation (OCR, image recognition, object recognition, etc.), making archival collections accessible, and doing 3D digitization of architectural models. They will then join forces with the International Image Interoperability Framework (IIIF) community to develop the capability to represent these objects. He also added that they want to help the people who do not have the ability to do it on their own.

img_6974
Wendy Hanamura is presenting the IA’s strategic plan for 2015-2020

After lunch, Wendy Hanamura walked us quickly through the Archive’s strategic plan for 2015-2020 and IA’s tools and projects. Some of these plans are:

  • Next generation Wayback Machine
    • Test pilot with Mozilla so they suggest archived pages for the 404
    • Wikimedia link rots
  • Building libraries together
  • The 20 million books
    • Table top scribe
    • Open library and discovery tool
    • Digitization supercenter
    • Collaborative circulation system
  • Television Archive — Political ads
  • Software and emulation
  • Proprietary code
  • Scientific data and Journals – Sharing data
  • Music — 78’s

No book should be digitized twice!”, this is how Wendy Hanamura ended her talk.

img_6973Then we had a chance to put our hands on the new tools by the IA and by their partners through having multiple makers’ space stations. There were plenty of interesting projects, but I focused on the International Research Data Commons– by Karissa McKelvey and Max Ogden from the Dat Project. Dat is a grant-funded project, which introduces open source tools to manage, share, publish, browse, and download research datasets. Dat supports peer-to-peer distribution system, (e.g., BitTorrent). Ogden mentioned that their goal is to generate a tool for data management that is as easy as Dropbox and also has a versioning control system like GIT.

After a break Jeffrey Mackie-Mason, the University Librarian of UC Berkeley, interviewed Brewster Kahle about the future of libraries and online knowledge. The discussion focused on many interesting issues, such as copyrights, digitization, prioritization of archiving materials, cost of preservation, avoiding duplication, accessibility and scale, IA’s plans to improve the Wayback Machine and many other important issues related to digitization and preservation. At the end of the interview, Kahle announced his white paper, which wrote entitled “Transforming Our Libraries into Digital Libraries”, and solicited feedback and suggestions from the audience.

https://twitter.com/tripofmice/status/791790807736946688

https://twitter.com/tripofmice/status/791786514497671168

Brad Shirakawa
The photographer Brad Shirakawa while taking  an aerial shot at the Great room.

At the end of the day, we had an unusual and creative group photo by the great photographer Brad Shirakawa who climbed out on a narrow plank high above the crowd to take our picture.

On day two the first session I attended was a keynote address by Brewster Kahle about his vision for the Internet Archive’s Library of 2020, and what that might mean for all libraries.

Heather Christenson from HeathiTrust.
Heather Christenson from HeathiTrust.

Heather Christenson, the Program Officer for HathiTrust, talked about where HeathiTrust is heading in 2020. Christenson started by briefly explaining what is HathiTrust and why HathiTrust is important for libraries. Christenson said that HathiTrust’s primary mission is preserving for print and digital collections, improving discovery and access through offering text search and bibliographic data APIs, and generating a comprehensive collection of the US federal documents. Christensen mentioned that they did a survey about their membership and found that people want them to focus on books, videos, and text materials.

A panel discussion about the Legal Strategies and Practices for libraries.
A panel discussion about the Legal Strategies and Practices for libraries.

Our next session was a panel discussion about the Legal Strategies Practices for libraries by Michelle Wu, the Associate Dean for Library Services and Professor of Law at the Georgetown University Law Center, and Lila Bailey, the Internet Archive’s Outside Legal Counsel. Both speakers shared real-world examples and practices. They mentioned that the law has never been clearer and it has not been safer about digitizing, but the question is about access. They advised the libraries to know the practical steps before going to the institutional council. “Do your homework before you go. Show the usefulness of your work, and have a plan for why you will digitize, how you will distribute, and what you will do with the takedown request.”

Tom Rieger talks about the LOC’s 2020 strategic plan.
Tom Rieger talks about the LOC’s 2020 strategic plan.

After the panel Tom Rieger, the Manager of Digitization Services Section at the Library of Congress (LOC), discussed the 2020 vision for the Library of Congress. Reiger spoke of the LOC’s 2020 strategic plan. He mentioned that their primary mission is to serve the members of Congress, the people in the USA, and the researchers all over the world by providing access to collections and information that can assist them in decision making. To achieve their mission the LOC plans to collect and preserve the born digital materials and provide access to these materials, as well as providing services to people for accessing these materials. They will also migrate all the formats to an easily manageable system and will actively engage in collaboration with many different institutions to empowering the library system, and adapt new methods for fulfilling their mission.

img_6991

 

In the evening, there were different workshops about tools and APIs that IA and their partners provided. I was interested in the RDM workshop by Max Ogden and Roger Macdonald. I wanted to explore the ways we can support and integrate this project into the UC Berkeley system. I gained more information about how the DAT project worked through live demo by Ogden. We also learned about the partnership between the Dat Project and the Internet Archive to start storing scientific data and journals at scale.

Notes from “Long-Term Storage for Research Data Management” session.

We then formed into small groups around different topics on our field to discuss what challenges we face and generate a roadmap for the future. I joined the “Long-Term Storage for Research Data Management” group to discuss what the challenges and visions of storing research data and what should libraries and archives do to make research data more useful. We started by introducing ourselves. We had Jefferson Bailey from the Internet Archive, Max Ogden, Karissa from the DAT project, Drew Winget from Stanford libraries, Polina Ilieva from the University of California San Francisco (UCSF), and myself, Yasmin AlNoamany.

Some of the issues and big-picture questions that were addressed during our meeting:

  • The long-term storage for the data and what preservation means to researchers.
  • What is the threshold for reproducibility?
  • What do researchers think about preservation? Does it mean 5 years, 15 years, etc.?
  • What is considered as a dataset? Harvard considers anything/any file that can be interpreted as a dataset.
  • Do librarians have to understand the data to be able to preserve it?
  • What is the difference between storage and preservation? Data can be stored, but long-term preservation needs metadata.
  • Do we have to preserve everything? If we open it to the public to deposit their huge datasets, this may result in noise. For the huge datasets what should be preserved and what should not?
  • Privacy and legal issues about the data.

Principles of solutions

  • We need to teach researchers how to generate metadata and the metadata should be simple and standardized.
  • Everything that is related to research reproducibility is important to be preserved.
  • Assigning DOIs to datasets is important.
  • Secondary research – taking two datasets and combine them to produce something new. In digital humanities, many researchers use old datasets.
  • There is a need to fix the 404 links for datasets.
  • There is should be an easy way to share data between different institutions.
  • Archives should have rules for the metadata that describe the dataset the researchers share.
  • The network should be neutral.
  • Everyone should be able to host a data.
  • Versioning is important.

Notes from the other Listening posts:

img_7007
Polina Ilieva from UCSF wrapped up the meeting.

At the end of the day, Polina Ilieva, the Head of Archives and Special Collections at UCSF, wrapped up the meeting by giving her insight and advice. She mentioned that for accomplishing their 2020 goals and vision, there is a need to collaborate and work together. Ilieva said that the collections should be available and accessible for researchers and everyone, but there is a challenge of assessing who is using these collections and how to quantify the benefits of making these collections available. She announced that they would donate all their microfilms to the Internet Archive! “Let us all work together to build a digital library, serve users, and attract consumers. Library is not only the engine for search, but also an engine for change, let us move forward!” This is how Ilieva ended her speech.

It was an amazing experience to hear about the 2020 vision of the libraries and be among all of the esteemed library leaders I have met. I returned with inspiration and enthusiasm for being a part of this mission and also ideas for collaboration to advance the library mission and serve more people.

–Yasmin AlNoamany