digital humanities

PhiloBiblon 2025 n. 1 (enero): Final de una época

Posted on January 31, 2025February 1, 2025 by Charles Faulhaber

La semana pasada colgué las versiones últimas y finales de BETA, BITECA y BITAGAP en la web de PhiloBiblon de la Universitat Pompeu Fabra. Desde este momento no habrá más cambios en los datos presentados allí. Dentro de unos meses, un PhiloBiblon nuevo y renovado aparecerá en FactGrid:PhiloBiblon en la universidad alemana de Jena, donde compartirá espacio virtual con otros cuarenta proyectos históricos, desde los estudios cuneiformes hasta la época nazi en Alemania.

Es el final de un largo periplo que empezó en 1975 con el proyecto Dictionary of the Old Spanish Language de la University of Wisconsin, encabezado por Lloyd Kasten, con quien había estudiado yo ecdótica unos diez años antes. Fue el primer proyecto de humanidades digitales en el ámbito hispanico. Kasten fue secundado por su joven colega John Nitti, el arquitecto del entorno informático del proyecto. Para garantizar la autenticidad del corpus de español antiguo, se necesitaba un repertorio de manuscritos españoles escritos antes de 1501. Tal repertorio no existió y pusieron a ayudantes de investigación a vaciar dos obras, el Catálogo de los manuscritos castellanos de la Real Biblioteca de El Escorial, del benemérito, y martirizado, padre Julián Zarco Cuevas; y la Bibliografía de la literatura hispánica del igualmente benemérito José Simón Díaz. Identificaron 966 obras y Kasten y Nitti imprimieron el resultado como Bibliography of Old Spanish Texts en 1975. Siguió una segunda edición en 1977, con 1869 fichas. Al año siguiente yo me encontraba en la Hispanic Society, redactando su catálogo de manuscritos medievales y enviando adiciones y correcciones a BOOST. En 1981 Kasten y Nitti me ofrecieron la dirección de BOOST como proyecto autónomo, y lo demás, como dicen, es historia.

Pronto tuve un colaborador, un jovencísimo Ángel Gómez Moreno (UCM), que acababa de pasar un año de estudios de posgrado en Wisconsin. Al volver a España se lanzó con entusiasmo a la tarea de repasar los ficheros y catálogos (en papel, por supuesto), de la BNE, la Biblioteca de Palacio, la RAE y la RAH, siempre en busca de nuevos manuscritos para BOOST. En los años 80 estudiantes de carrera y posgrado sirvieron como ayudantes de investigación, primero María Morrás (UPF), y en los 90 Ángela Moll Dexeus y Antonio Cortijo (UC Santa Barbara), durante su paso por las aulas de Berkeley. Ya desde 1993 BOOST era BETA, la Bibliografía Española de Textos Antiguos en CD-ROM y en la red en 1997. En 2000 Ángel me mandó como becario por dos años a Óscar Perea Rodríguez; y desde entonces Óscar ha sido mi mano derecha en el proyecto, organizando los seminarios de PhiloBiblon, primero en San Millán de la Cogolla y luego en Madrid. Más recientemente ha trabajado como ayudante en Berkeley Heather Bamford (George Washington University), mientras que, en España, en la UCM, Álvaro Bustos ha participado en varias campañas de investigación conmigo y ha reclutado nuevos miembros del equipo de BETA.

Kasten y Nitti fueron responsables también de la Bibliography of Old Catalan Texts (BOOCT, 1985), base de datos gemela a BOOST, compilada por Beatrice Concheff (†), antigua alumna de la University of Wisconsin y que, en aquel tiempo, ejercía como profesora de inglés en una escuela de lenguas de Barcelona. Era una aficionada a la literatura catalana, pero no era experta en ella en términos académicos; por ello, la inmensa mayoría de sus citas a obras y manuscritos medievales provenía de fuentes secundarias. Y además, en el año de 1989, por desgracia, Concheff se encontraba con una salud muy delicada y era consciente de que no podría dedicarse a la compilación de una segunda edición de BOOCT.

En una de las sesiones de trabajo del congreso de la Asociación Internacional de Hispanistas de Barcelona de 1989, mi amigo Francisco Marcos Marín me presentó a Vicenç Beltrán. Expuesto el caso a Vicenç, vio inmediatamente la importancia de seguir adelante con BOOCT. En seguida Vicenç embarcó en el proyecto a su jovencísima alumna, doctoranda en aquel entonces, Gemma Avenoza i Vera (†). Fue Gemma la que enseguida quedó al cargo del mismo. Era ya una codicóloga consumada, como se puede comprobar en su tesis doctoral, Repertori dels manuscrits en llengües romàniques conservats a biblioteques barcelonines. Además, Gemma era de los escasos expertos en filología de la época que sabía manejar un ordenador, habilidad absolutamente necesaria para trabajar con PhiloBiblon. Ambos, Gemma y Vicenç, romanistas al estilo antiguo y expertos no sólo en las literaturas medievales ibéricas, sino también en las de Francia e Italia, comprobaron la utilidad del (entonces recién bautizado) PhiloBiblon, pues podría facilitar la tarea de catalogación y descripción de los textos y testimonios del corpus de literatura medieval ibérica en lenguas castellana, catalana, portuguesa y gallego-portuguesa.

A los pocos meses de ese primer encuentro en Barcelona, en el otoño de 1989 Gemma ya pudo visitarme en Berkeley para adentrarse en el arcano mundo de PhiloBiblon y encargarse, junto con Vicenç, de BOOCT, que pronto cambiaría su nombre en BITECA (Bibliografia de Textos Catalans Antics), de forma paralela a BETA. Desde entonces hasta su demasiado temprana muerte en 2021, Gemma fue la mano directriz de BITECA, pronto secundada por Lourdes Soriano, alumna tanto de Gemma como de Vicenç. Durante casi 20 años, pues, Gemma, Lourdes y yo estuvimos en contacto constante. Las dos supieron atraer a más de medio centenar de colaboradores y ayudantes de investigacíon. Los miembros actuales del equipo de BITECA son Joan Mahiques Climent (Universitat Jaume I), Joan M. Perujo (Universitat d’Alacant), Helena Rovira Cerdà (Universidad de Murcia), Glòria Sabaté (Universitat de Barcelona) y Ferran Triadó Sales, (Universitat de Barcelona), todos ellos alumnos o antiguos alumnos de Gemma y Lourdes.

Ya por la misma época, en 1988, entablé discusiones con mi querido colega y vis-à-vis (literalmente) en el departamento de español y portugués de Berkeley, Arthur Askins, sobre la posibilidad de crear una base de datos paralalela a BOOST y BOOCT. Se llamaría Bibliography of Old Portuguese Texts (BOOPT) y formaría un nuevo tomo en la “Bibliographic Series” de la Hispanic Seminary of Medieval Studies. Pero ya la época de publicación en papel había pasado. Con una subvención de la Sociedad Estatal del Quinto Centenario, BETA, BITECA y el recien bautizado BITAP (Bibliografia de Textos Antigos Portugueses) saldría en disco CD-ROM en 1993. Más tarde, el nombre BITAP se cambiaría en BITAGAP (Bibliografia de Textos Antigos Galegos e Portugueses).

Arthur había sido testigo ocular de mis investigaciones con BOOST desde 1981 y se había dado cuenta de la utilidad de las bases de datos en sus propias investigaciones sobre la biblioteca de Fernando Colón. Estaba intrigado, pero se dio cuenta desde el principio que esto tendría que ser un proyecto de equipo. Reclutó a sus amigos y antiguos compañeros de estudios de Berkeley, Harvey Sharrer (†), profesor en la University of California, Santa Bárbara, y Martha E. Schaffer, en aquel entonces profesora la University of Chicago y actualmente en la University of San Francisco.

Iniciados sus viajes de investigación anuales en Portugal, muy pronto los tres persuadieron a Aida Fernanda Dias (†), profesora de la Universidade de Coimbra, que se uniera al equipo. Luego, comenzando en 2008, se incorporaron Pedro Pinto (Universidade Nova de Lisboa), con su olfato extraordinario para encontrar manuscritos de interés, Cristina Sobral (Universidade de Lisboa), Filipe Alves Moreira (Universidade do Porto, actualmente Universidade Aberta), Maria de Lurdes Rosa (Universidade Nova de Lisboa), Ricardo Pichel Gotérrez (Universidade de Santiago y actualmente Universidad de Educación a Distancia), Mariña Arbor (Universidade de Santiago y Diana Fontão (Universidade de Porto / Universidad de Salamanca).

Como en los casos de BETA y BITECA, el equipo de BITAGAP ha contado con al menos una centena de buenos colaboradores, entre estudiantes y colegas del mundo entero.

PhiloBiblon no sólo ha sido creación de las personas mencionadas aquí, sino de todos aquellos colegas que nos han escrito pidiendo u ofreciendo información sobre tal o cual texto, manuscrito o persona. Por y para ellos hemos dedicado tanto esfuerzo durante más de cuarenta años, el mismo esfuerzo precisamente que todos los miembros del equipo vamos a continuar haciendo en esta nueva etapa que se abre ante nosotros.

Charles B. Faulhaber
University of California, Berkeley

Referencias:

Faulhaber, Charles B., PhiloBiblon, pasado, presente, futuro: Historia de un proyecto de investigación, San Millán de la Cogolla: Congreso Internacional. Códices literarios españoles (Edad media) 2007-11-29

_____, PhiloBiblon: Pasado y futuro, (Incipit 29 2009: pp. 191-200)

_____, PhiloBiblon, Information Technology, and Medieval Spanish Literature: A Balance Sheet, Bern: Humanitats a la Xarxa: Món Medieval – Humanities on the Web: The Medieval World 2014: pp. 15-43

_____, PhiloBiblon and the Semantic Web. Notes for a Future History, València: La Literatura Medieval Hispánica en la Imprenta (1475-1600) 2016: pp. 75-93

____. Harvey L. Sharrer (1940-2024). PhiloBiblon blog 2024-10-10

Dias, Aida Fernanda, Arthur Lee-Francis Askins, Harvey L. Sharrer, A ‘Bibliografia de Textos Antigos Galegos e Portugueses’ na Internet, Lisboa: Revista Portuguesa de História do Livro 1 n. 2 1998: pp. 173-80)

Martín Molares, Mónica PhiloBiblon 2020 n. 1 (febrero): Crónica del V Seminario Internacional PhiloBiblon-BNE-UCM (Madrid, junio de 2019), (PhiloBiblon Blog 2020-02-28)

Perea Rodríguez, Óscar, y Charles B. Faulhaber, Esbozos sobre la evolución y el futuro de un pionero de las humanidades digitales hispánicas: el proyecto de PhiloBiblon, Revista Digital Universitaria 18 n. 1 2017

Schaffer, Martha E. Homenagem a Aida Fernanda Dias oferecida pelos seus colegas de Philobiblon, eHumanista 31 2015: pp. i-v, 1-275)

Sharrer, Harvey L., BITAGAP (Bibliografia de Textos Antigos Galegos e Portugueses): um armazém da memória histórica, Madrid: (Tenh’eu que mi fez el i mui gran ben. Estudos sobre cultura escrita medieval dedicados a Harvey L. Sharrer 2022: pp. 39-67)

_____, A BITAGAP no seu 30^o aniversário: a expansão de dados e expectativas para o futuro, Berlin: O Medievalismo no século XXI 2020: pp. 157-68)

_____, The BITAGAP project since 1988 – Expansion of the corpus of texts and important discoveries, Bern: Humanitats a la Xarxa: Món Medieval – Humanities on the Web: The Medieval World 2014: pp. 169-83

Digital Archives and the DH Working Group on Nov. 4

Posted on October 23, 2024 by Bee Lehman

To my delight, I can now announce that the next Digital Humanities Working Group at UC Berkeley is November 4 at 1pm in Doe Library, Room 223.

For the workshop, we have two amazing speakers for lightning talks. They are:

Danny Benett, MA Student in Folklore, will discuss the Berkeley folklore archive which is making ~500,000 folklore items digitally accessible.

Adrienne Serra, Digital Projects Archivist at The Bancroft Library, will demo an interactive map in ArcGIS allowing users to explore digital collections about the Spanish and Mexican Land grants in California.

We hope to see you there! Do consider signing up (link) as we order pizza and like to have loose numbers.

The UC Berkeley Digital Humanities Working Group is a research community founded to facilitate interdisciplinary conversations in digital humanities and cultural analytics. It is a welcoming and supportive community for all things digital humanities.

The event is co-sponsored by the D-Lab and Data & Digital Scholarship Services.

A&H Data: What even is data in the Arts & Humanities?

Posted on September 24, 2024September 25, 2024 by Bee Lehman

This is the first of a multi-part series exploring the idea and use of data in the Arts & Humanities. For more information, check out the UC Berkeley Library’s Data and Digital Scholarship page.

Arts & Humanities researchers work with data constantly. But, what is it?

Part of the trick in talking about “data” in regards to the humanities is that we are already working with it. The books and letters (including the one below) one reads are data, as are the pictures we look at and the videos we watch. In short, arts and humanities researchers are already analyzing data for the essays, articles, and books that they write. Furthermore, the resulting scholarship is data.

For example, the letter below from Bancroft Library’s 1906 San Francisco Earthquake and Fire Digital Collection on Calisphere is data.

George Cooper Pardee, “Aid for San Francisco: Letter from the Mayor in Oregon,”
April 24, 1906, UC Berkeley, Bancroft Library on Calisphere.

One ends up with the question “what isn’t data?”

The broad nature of what “data” is means that instead of asking if something is data, it can be more useful to think about what kind of data one is working with. After all, scholars work with geographic information; metadata (e.g., data about data); publishing statistics; and photographs differently.

Another helpful question is to consider how structured it is. In particular, you should pay attention to whether the data is:

unstructured
semi-structured
structured

The level of structure informs us how to treat the data before we analyze it. If, for example, you have hundreds of of images, you want to work with, it’s likely you’ll have to do significant amount of work before you can analyze your data because most photographs are unstructured.

For example, with this picture of a ceramic hedgehog, the adorable animal, the photograph, and the metadata for the photograph are all different kinds of data. Image: Zde, Ceramic Rhyton in the Form of a Hedgehog, 14. to 13. century BCE, Photograph, March 15, 2014, Wikimedia Commons. | Creative Commons Attribution-Share Alike 3.0 Unported.

In contrast, the letter toward the top of this post is semi-structured. It is laid out in a typical, physical letter style with information about who, where, when, and what was involved. Each piece of information, in turn, is placed in standardized locations for easy consumption and analysis. Still, to work with the letter and its fellows online, one would likely want to create a structured counterpart.

Finally, structured data is usually highly organized and, when online, often in machine-readable chart form. Here, for example, are two pages from the Polk San Francisco City Directory from 1955-1956 with a screenshot of the machine-readable chart from a CSV (comma separated value) file below it. This data is clearly structured in both forms. One could argue that they must be as the entire point of a directory is for easy of information access and reading. The latter, however, is the one that we can use in different programs on our computers.

Page from San Francisco city directory with columns listing businesses with their addresses.

Screenshot of excell sheet with publisher addresses in columns

R.L. Polk & Co, Polk’s San Francisco City Directory, vol. 1955–1956 (San Francisco, Calif. : R.L. Polk & Co., 1955),
Internet Archive. | Public Domain.

This post has provided a quick look at what data is for the Arts&Humanities.

The next will be looking at what we can do with machine-readable, structured data sets like the publisher’s information. Stay tuned! The post should be up in two weeks.

Correspondance complète de Rousseau ONLINE

Posted on August 12, 2024February 19, 2026 by Claude Potts

engraving of Jean-Jacques Rousseau — *Image: Jean-Jacques Rousseau, 1712-1778. Austrian National Library*

In partnership with the Voltaire Foundation, the Correspondance complète de Rousseau ONLINE makes Ralph Leigh’s critical edition in 52 volumes in the original French-language available as an ebook collection for the first time. The digital corpus gathers together all 8,000 letters written to and by one of the most important figures of eighteenth-century intellectual history, as well as the correspondence between third parties relating to the writer and his time. Drafts and copies have been collated against the original manuscripts and all variants reproduced. The extensive annotations identify individuals, events and places, explain the linguistic usages of the eighteenth century, give bibliographical information and clarify obscure allusions.

This library purchase was made possible with the generous support from the Archie & Harriett Maclean Endowed Fund for French Culture.

Oral History Project Wins Autry Public History Prize

Posted on November 14, 2023November 16, 2023 by UC Berkeley Library

Side-by-side portraits of Emi Kuboyama and Todd Holmes

The UC Berkeley Oral History Center (OHC) is thrilled to announce that OHC historian Todd Holmes and project partner Emi Kuboyama from Stanford University have won the 2023 Autry Public History Prize for their digital project, Redress: An Oral History. The award is given by the Western History Association for the best project in public history. Released to the public in 2022, the project documents the history of Japanese American Redress through oral histories and a documentary film, which are featured with related historical resources on a dedicated educational website.

Portrait of Emi Kuboyama — Emi Kuboyama, Project Creator and Interviewer

Holmes and Kuboyama began the project in 2018 with the initial goal of documenting the history of the Office of Redress Administration (ORA), the little-known agency charged with administering redress by the Civil Liberties Act of 1988. Emi Kuboyama, the principal creator of the project, had a direct link to the agency and its work. As a native of Hawaii, she was no stranger to the history of Japanese American incarceration or the impact that dark period still held in Japanese American communities. She also began her legal career with the agency in 1994, an experience that had a profound impact on her personally and professionally.

In 2017, Kuboyama attended the OHC’s Advanced Oral History Institute to explore how oral history could help document the historic redress program and the work of the ORA. There she met OHC historian Todd Holmes and the two agreed to partner on the project. With the support of a Japanese American Confinement Sites grant from the National Parks Service, they conducted over a dozen interviews with former ORA staff, as well as community leaders affiliated with the program. The recordings and transcripts of those interviews are now housed at the Densho Digital Repository. Upon the completion of the oral history interviews, Holmes and Kuboyama recognized the need to put the history of the ORA into conversation with the experience of the Japanese American community in its forty-six-year journey from internment to redress. With the generous support of the Henri and Tomoye Takahashi Foundation, they enlisted the help of filmmaker Jon Ayon. The collaboration resulted in the film, Redress, which offers the first in-depth look at the history of Japanese American redress as told by the community members who took part in the program, and the government professionals who administered it.

Portrait of Todd Holmes — Todd Holmes, Project Co-Creator and Videographer

The last part of this digital project was to create a website that would not only serve as a home for the oral histories and film, but also an educational space for students and the public to learn more about the history of redress. Created by Todd Holmes and Heidi Holmes, the website features two historical pages that supplement the film and oral histories, as well as a resources page that points visitors to related historical material such as books, films, and oral history collections. Since the project’s release in fall 2022, the website has received over 43,000 visitors.

The prize was awarded to Holmes and Kuboyama in October 2023 at the annual Western History Association Conference. In the awards program, the Autry Committee praised the Redress project as “an excellent model of professional public history practice that documents a moment in Western American History that has particular significance for today’s conversations about reparations within other marginalized groups.” The committee also applauded how the project “showcases the power of the medium of oral history.”

The Oral History Center congratulates Todd Holmes, Emi Kuboyama, and their partners on an outstanding project and contribution. For more on the history of Japanese American Redress, visit the project website. And to learn more about the Japanese American experience and the legacy of WWII, see the new oral histories of the OHC’s Japanese American Intergenerational Narratives project, which are featured in the newest season of The Berkeley Remix podcast.

Resources

Redress: An Oral History website

Oral History Center’s Japanese American Intergenerational Narratives Oral History Project

The Berkeley Remix podcast: Season 8: “‘From Generation to Generation’: The Legacy of Japanese American Incarceration”