On March 9, 2021, the Center for Southeast Asian Studies, Institute of East Asian Studies, the Institute of South Asia Studies, and the Human Rights Center at UC Berkeley hosted the online symposium Scholar-Activism and the Myanmar Resistance. The event invited scholar-activists to analyze and strategize for resistance to Myanmar’s military coup. The Office of Scholarly Communication Services collaborated with Dr. Hilary Faxon, Ciriacy-Wantrup Postdoctoral Fellow at UC Berkeley, to organize an afternoon workshop to explore the law, ethics, methods, and goals of archiving social media coverage of the coup.
Faxon highlighted that in the months since the military seized power on February 1, the internet has become a key domain of struggle in Myanmar. The military has cut off internet access and (before being banned) used Facebook to disseminate misinformation. Meanwhile, democracy activists have used social media alongside traditional tactics of street protests and general strikes to resist the regime.
The workshop brought together a diverse group of participants from across and beyond campus with perspectives from human rights, research and journalism, including WITNESS and Berkeley’s Human Rights Investigation Lab. Stacy Reardon, Literatures and Digital Humanities Librarian, discussed services and workshops offered by Digital Humanities at Berkeley, as well as tools used to conduct DH research, such as the Wayback Machine, Conifer, 4k download, Adobe Bridge, and others.
Workshop discussions were centered around a commitment to a shared ethics of care approach to using, sharing, and archiving information social media content related to the coup. The ethics of care framework suggests that what we do as information collectors or analyzers will affect other people, particularly when people have less structural power, and according to the ethics of care, we should care about that. This becomes immediately apparent when deciding whether or how to collect, process, and share potentially sensitive social media posts, images, and videos from the Myanmar coup, especially when doing so could have dire consequences for activists who are the subjects of those posts.
During the workshop, we talked about how the Library has adopted a form of ethics of care in our approach to making decisions about what collection materials we’ll digitize and put online. Our version of ethics of care is framed as a balancing principle: that is, we look to whether the value to researchers, the public, or cultural communities in digitizing and sharing the content outweighs the potential for harm or exploitation of people, resources, or knowledge.
Several takeaways emerged by the end of the workshop discussion:
- Protecting and defending human rights: Archiving material from social media—including videos, photos, and live streams—might help ensure perpetrators of violence are held accountable, but the production and circulation of such materials can also be highly-incriminating for media creators and platform users.
- Collecting is collaborative: Usage of archives is bound up with the intentions of those creating material, and so archiving requires an ongoing, bi-directional conversation between those creating content and those doing the archiving.
- Circumstances change: Both ethical and organizational approaches should be discussed and decided in advance of archiving. But expect situations to change – what is safe and straightforward to keep today may be more risky tomorrow.
- Capturing versus sharing: These are different processes, and “archiving” does not necessarily have to entail both. The benefits and risks associated with collecting data are distinct from those associated with sharing data or making it publicly available, so these processes should be considered separately.
- Law and ethics: Regardless of what is allowed under U.S. copyright law, there may be other contracts and terms of service that restrict what you can do with materials. Moreover, collecting voluntarily-released data may not violate legal privacy rights, but may present ethical questions.
- Data security: Develop a Data Management Plan that addresses organization and protection both during archiving, and after the project is completed. Consider a special purpose account for collaborations and data sharing.
- Data hygiene: Don’t collect more than you need.
- Practical strategies: Tools may depend on the specific goals of a researcher and the scale of the project. It is important to ask what, precisely, you mean when you say “archiving,” and what the purpose of creating your archive might be.
- Seek out a community of practice to support and situate your efforts.
We hope the workshop helped researchers to better understand the legal and ethical considerations in collecting, processing, and sharing potentially sensitive social media content of events like the Myanmar resistance. The Library and a broad community of supporters are here to help scholars address these challenges and equip them to proceed with confidence, care, and sound practices.
This update is cross-posted from the Building LLTDM blog.
On June 23-26, we welcomed 32 digital humanities (DH) researchers and professionals to the Building Legal Literacies for Text Data Mining (Building LLTDM) Institute. Our goal was to empower DH researchers, librarians, and professional staff to confidently navigate law, policy, ethics, and risk within digital humanities text data mining (TDM) projects—so they can more easily engage in this type of research and contribute to the further advancement of knowledge. We were joined by a stellar group of faculty to teach and mentor participants. Building LLTDM is supported by a grant from the National Endowment for the Humanities.
Why was the Institute needed?
Until now, humanities researchers conducting text data mining in the U.S. have had to maneuver through a thicket of legal issues without much guidance or assistance. As an example, take a researcher scraping content about Egyptian artifacts from online sites or databases, or downloading videos about Egyptian tomb excavations, in order to conduct automated analysis about religion or philosophy. The researcher then shares these content-rich data sets with others to encourage research reproducibility or enable other researchers to query the data sets with new questions. This kind of work can raise issues of copyright, contract, and privacy law. It can also raise concerns around ethics, for example, if there are plausible risks of exploitation of people, natural or cultural resources, or indigenous knowledge.
Moving an interactive, design-thinking Institute online
After months of preparation, we had been looking forward to working and learning together at UC Berkeley, but the world had other plans for our Institute. Due to the global health crisis, we had to transform our planned in-person, intensive workshop into an interactive and relevant remote experience.
How did we do this? The pandemic meant we had to transition everything online, which of course presents challenges for a design-thinking framework. We are thrilled that our approach to interactive remote pedagogy was successful! (You can check out the schedule and framework in our Participant Packet.) The substantive content was pre-recorded and delivered in a flipped classroom model. Faculty created a series of short videos, and shared readings relevant to the legal literacies. We also provided the video transcripts and slides to participants to promote accessibility and accommodate multiple learning styles.
We used Zoom to meet synchronously for discussion in groups of various sizes. We used Slack for asynchronous communication, and interactive tools such as Mural for design thinking exercises like journey mapping so that everyone could live edit and collaborate. We capped each day with a “happy half hour” on Zoom as an informal way to get to know each other a little better, even from afar.
We also relied on an institute moderator and daily writing exercises to reinforce the design-thinking stages and learning outcomes. Each night, we reviewed the participants’ free-writes and began the next morning by reflecting back to the participants the themes from what they had shared.
Reflections on goals: social justice & effective empowerment
One of our priorities for the Institute was to invite a diverse pool of participants, including those involved in social justice research, in order to maximize the public value impact of Building LLTDM. We looked for demonstrated commitments to diversity and equity but could hardly have imagined the breadth and depth of experiences that applicants were willing to share. The selected participants research everything from understanding “place” data from community histories of historic African American settlements to the development of AIDS activist networks in communities of color; to portrayals of autism in literature; and more. Others demonstrated a commitment to bringing back the skills they learn to expand TDM opportunities for students and communities who have traditionally been marginalized or under-resourced. They also came from a variety of institution types, from research advising and support experience, professional roles, levels of experience with TDM, career stages, and disciplinary perspectives.
We are also moved by the participants’ own reflections on the experience. One of the last interactive exercises we hosted during the online Institute was a collective week-in-review discussion, and gratitude wall. We asked the participants to share what they were thankful for, highlighting other participants where possible. So many of the participants wrote about how valuable the learning experience was and how thoughtfully it was put together and delivered.
We can’t express the transformational impact of the week better than the participants, themselves. In Institute evaluation forms, they shared feelings like:
- “This is by far the best organized event that I have ever attended. The content was by far the most substantive. The faculty were by far the most engaged. A+ across the board.”
- “I am so grateful to have had the opportunity to engage with a diverse group of scholars (researchers and professionals)… The deliberately thought through breakdown and mix fostered incredibly valuable discussions and I would hope this kind of framework is used as a best practice for future DH institutes of all kinds going forward. Also, thank you for such an amazing virtual experience which I can only imagine took a tremendous amount of work to coordinate and plan with limited time to shift to an entirely different format–I was overjoyed to critically engage with complex subjects…”
- “This has been phenomenal. I don’t want to qualify it (by adding something like “…for having to be moved online”), because it’s been so, so good: well organized, thoughtful, and human throughout.”
- “There was clearly so much thought, care, and planning that went into the preparation of this institute, and it was an amazing opportunity to learn from a group of people — organizers, faculty, and participants — who all have such deep expertise. The video and readings lists alone are a huge resource, but to be able to process and reflect on that material together with a diverse group of people was really wonderful.”
Next steps, and our own gratitude
What’s next for Building LLTDM? The “Institute” is not over yet; only the 1-week training is complete. The cohort will be meeting again virtually in February 2021 to discuss how implementation of the literacies into our local communities and practices has gone. In the meantime, as the participants bring back the law and policy literacies they’ve learned to their home institutions, we are excited to see several cohort members already organizing their own post-Institute research subgroups, such as those whose TDM work relies heavily on social media content, and others who are exploring how to disseminate the Building LLTDM literacies within other instructional formats and frameworks.
As part of the grant, the project team will also be aggregating the resources from the Institute and developing supplementary material for an Open Educational Resource (OER). We know there is a large community of TDM researchers and professionals who may be interested in or who can benefit from these materials, and the OER will be made available for broad reuse in the public domain.
Thank you to all the participants for their insights and contributions, willingness to share, and flexibility in transitioning to a fully-remote Institute. Thank you to all the faculty for their unmatched legal and policy expertise, ongoing commitment to mentorship, and adaptability in content creation and delivery. And thank you again to the NEH for making such a meaningful experience possible.
“A king is always a king –and a woman is always a woman: his authority and her sex ever stand between them and rational converse.” – Mary Wollstonecraft
Recent Berkeley graduate Julia Burke begins her essay, “Over Mary’s Dead Body: Frankenstein, Sexism & Socialism,” a historiography and cultural critique of Shelley’s Frankenstein, with the above epigraph from Mary Wollstonecraft, the great political philosopher and Mary Shelley’s mother. Burke’s research into the reception of Frankenstein and in its possible influence on socialist radicals of the 1840s earned her the prestigious 2018 Charlene Conrad Liebau Library Prize for Undergraduate Research, an annual prize awarded to students who have done exceptional research and made significant use of the Library’s resources.
Burke’s paper is the subject of this semester’s rotating Library Prize Exhibit, located on the second floor of Doe between the Heyns Reading Room and Reference Hall. Drawing on the Library and the Bancroft’s broad collections, the exhibit outlines Burke’s arguments in visual form with digitized replicas of the original 1818 edition of Frankenstein, an early copy of The Communist Manifesto, letters, contemporary reviews, and more. The exhibition of Burke’s project coincides with the bicentennial of Frankenstein’s publication. Originally published anonymously, Frankenstein’s true author was greatly contested, as Burke explores. Today it is one of the most important works of the literary canon and the most read novel in undergraduate courses nationwide. The exhibit was curated by Stacy Reardon, the Literature and Digital Humanities Librarian, and designed by Aisha Hamilton, the Exhibits and Environmental Graphics Coordinator. The exhibit will be up until April 2019.
The Charlene Conrad Liebau Library Prize for Undergraduate Research is awarded annually, and submissions are now open to all undergraduates until April 18, 2019. Any project from a credit course at U.C. Berkeley from Spring 2018 to Spring 2019 (lower division) or Summer 2018 to Spring 2019 (upper division) is eligible. The project can be in progress as of the due date of the application. In addition to a monetary award of $750 for lower-division winners and $1000 for upper-division winners, the recipients of the Library Prize publish their work in eScholarship, and two will be featured in an exhibit in the Library. Find out more information here.
You can see the rest of this year’s winners and honorable mentions here. Don’t forget to stop by the exhibit to see Burke’s work in person. More books related to Frankenstein in honor of the bicentennial can be found here.
This month, we’re bringing you a special edition of our From the Archives department. Below are interviews, all available in the OHC archives, recommended by each of us. Enjoy digging through the crates!
Martin Meeker’s pick:
Andre Tchelistcheff: Grapes, Wine, and Technology. Some lives in our collection of interviews are just profoundly interesting, and well worth digging into. This might be because of difficulties surmounted, achievements recognized, or simply the quality of the telling. Our 1979 oral history with Andre Tchelistcheff reveals one such life that ticks all of those boxes. From his birth in Russia in 1901, through his harrowing escape during the Revolution, to his years in France studying viticulture, and his decades quite literally remaking California’s wine industry, Tchelistcheff lived a remarkably influential life while remaining rooted in his passions throughout.
Roger Eardley-Pryor’s pick:
J. Michael McCloskey (Mike McCloskey), “Sierra Club Executive Director and Chairman, 1980s-1990s: A Perspective on Transitions in the Club and the Environmental Movement,” conducted in 1998 and published in 1999, is the second oral history with Mike McCloskey as part of the Sierra Club Oral History Project. Mike, a longtime leader in one of the largest environmental organizations in the United States, discusses the Club’s growing pains associated with an upsurge in membership amid Ronald Reagan’s anti-environmental actions in the early 1980s. Today, in lieu of modern assaults against environmental protections, Mike’s oral history sheds light on ways environmentalists managed those challenges and even expanded their purview to international issues.
Amanda Tewes pick:
Paul Burnett’s pick:
I choose nurse educator and clinical nurse Angie Lewis, who worked at UC San Francisco during the early years of the AIDS crisis. In Lewis’ interview, we really hear what it was like to first learn of this then-unknown disease that was killing gay people in San Francisco in the early 1980s. But we also hear touching stories of the mobilization of community and medical support for those who were suffering from AIDS.
David Dunham’s pick:
David Blackwell: African American Faculty and Senior Staff Oral History Project. Named after an esteemed mathematician and the first African-American tenured professor at Cal, David Blackwell Hall opened this fall to honor Professor Blackwell. Read more about his pioneering life in his oral history, part of our African American and Senior Faculty Oral History Project.
Todd Holmes’ pick:
I’d recommend Francis Mary Albrier: Determined Advocate for Racial Equality. This oral history captures the extraordinary life of one of Berkeley’s most prominent citizens, from her leading role in fighting discriminatory hiring in the City’s schools and businesses to desegregating the famed Richmond Shipyards. Moreover, through her oral history, you get a clear view of the many unsung citizens that organized communities of color to collectively push for change.
Shanna Farrell’s pick:
When I was first learning how to conduct longform interviews, I drew inspiration from Willa Baum, former director of the Oral History Center. She was an amazing interviewer, and her oral history interview provided insight into who she was, what drove her, and how she built the reputation of our office.
“Research software” presents a significant challenge for efforts aimed at ensuring reproducibility of scholarship. In a collaboration between the UC Berkeley Library and the California Digital Library, John Borghi and I (Yasmin AlNoamany) conducted a survey study examining practices and perceptions related to research software. Based on 215 participants, representing a variety of research disciplines, we presented the findings of asking researchers questions related to using, sharing, and valuing software. We addressed three main research questions: What are researchers doing with code? How do researchers share their code? What do researchers value about their code? The survey instrument consisted of 56 questions.
We are pleased to announce the publication of paper describing the results of our survey “Towards computational reproducibility: researcher perspectives on the use and sharing of software” in PeerJ Computer Science. Here are some interesting findings from our research:
- Results showed that software-related practices are often misaligned with those broadly related to reproducibility. In particular, while scholars often save their software for long periods of time, many do not actively preserve or maintain it. This perspective is perhaps best encapsulated by one of our participants who, when completing our open response question about the definition of sharing and preserving software, wrote ” ‘Sharing’ means making it publicly available on Github. ‘Preserving’ means leaving it on GitHub”.
- Only 50.51% of our participants were aware of software-related community standards in their field or discipline.
- Participants from computer scientists reported that they provide information about dependencies and comments in their source code more than those from other disciplines.
- Regarding to sharing software, we found that the majority of participants who do not share their code, they indicated that had privacy issues and time limitation to prepare code for sharing.
- Regarding to preservation, only a 20% of our participants reported that they save their software for eight years or more, 40% indicated that they do not prepare their software for long term preservation. The majority of participants (76.2%) indicated that they use Github for preserving software.
- The majority of our participants indicated that view code or software as “first class” research products that should be assessed, valued, and shared in the same way as a journal article. However, our results also indicate that there remains a significant gap between this perception and actual practice. As a result we encourage the community to work together for creating programs to train researchers early on how to maintain their code in the active phase of their research.
- Some of researchers’ perspectives on the usage of code/software:
“Software is the main driver of my research and development program. I use it for everything from exploratory data analysis, to writing papers…
- “I use code to document in a reproducible manner all steps of data analysis, from collecting data from where they are stored to preparing the final reports (i.e. a set of scripts can fully reproduce a report or manuscript given the raw data, with little human intervention).”
- Some of researchers’ perspectives on sharing and preservation:
- “I think of sharing code as making it publicly accessible, but not necessarily advertising it. I think of preserving code as depositing it somewhere remotely, where I can’t accidentally delete it. I realize that GitHub should not be the end goal of code preservation, but as of yet I have not taken steps to preserve my code anywhere more permanently than GitHub.”
- “…’Sharing’, to me, means that somebody else can discover and obtain the code, probably (but not necessarily) along with sufficient documentation to use it themselves. ‘Preserve’ has stronger connotations. It implies a higher degree of documentation, both about the software itself, but also its history, requirements, dependencies, etc., and also feels more “official”- so my university’s data repository feels more ‘preserve’-ish than my group’s Github page.”
For more details and in-depth discussion on the initial research, the paper is available and open access here: https://peerj.com/articles/cs-163/. All the other related files to this project can be found here: https://yasmina85.github.io/swcuration/
by Taylor Follett
Fall semester is always a time of fresh beginnings — new classes, new faces, and most excitingly for those of us at the library, access to new resources. We hope that the following new databases, books, journals, and much more will be of value to those studying literature. Here are some highlights for undergraduates, graduate students, and professors alike.
— Yasmina Anwar (@yasmina_anwar) February 13, 2018
Last week, the University Library, the Berkeley Institute for Data Science (BIDS), the Research Data Management program were delighted to host Love Data Week (LDW) 2018 at UC Berkeley. Love Data Week is a nationwide campaign designed to raise awareness about data visualization, management, sharing, and preservation. The theme of this year’s campaign was data stories to discuss how data is being used in meaningful ways to shape the world around us.
At UC Berkeley, we hosted a series of events designed to help researchers, data specialists, and librarians to better address and plan for research data needs. The events covered issues related to collecting, managing, publishing, and visualizing data. The audiences gained hands-on experience with using APIs, learned about resources that the campus provides for managing and publishing research data, and engaged in discussions around researchers’ data needs at different stages of their research process.
Participants from many campus groups (e.g., LBNL, CSS-IT) were eager to continue the stimulating conversation around data management. Check out the full program and information about the presented topics.
Photographs by Yasmin AlNoamany for the University Library and BIDS.
LDW at UC Berkeley was kicked off by a walkthrough and demos about Scopus APIs (Application Programming Interface), was led by Eric Livingston of the publishing company, Elsevier. Elsevier provides a set of APIs that allow users to access the content of journals and books published by Elsevier.
In the first part of the session, Eric provided a quick introduction to APIs and an overview about Elsevier APIs. He illustrated the purposes of different APIs that Elsevier provides such as DirectScience APIs, SciVal API, Engineering Village API, Embase APIs, and Scopus APIs. As mentioned by Eric, anyone can get free access to Elsevier APIs, and the content published by Elsevier under Open Access licenses is fully available. Eric explained that Scopus APIs allow users to access curated abstracts and citation data from all scholarly journals indexed by Scopus, Elsevier’s abstract and citation database. He detailed multiple popular Scopus APIs such as Search API, Abstract Retrieval API, Citation Count API, Citation Overview API, and Serial Title API. Eric also overviewed the amount of data that Scopus database holds.
In the second half of the workshop, Eric explained how Scopus APIs work, how to get a key to Scopus APIs, and showed different authentication methods. He walked the group through live queries, showed them how to extract data from API and how to debug queries using the advanced search. He talked about the limitations of the APIs and provided tips and tricks for working with Scopus APIs.
Eric left the attendances with actionable and workable code and scripts to pull and retrieve data from Scopus APIs.
On the second day, we hosted a Data Stories and Visualization Panel, featuring Claudia von Vacano (D-Lab), Garret S. Christensen (BIDS and BITSS), Orianna DeMasi (Computer Science and BIDS), and Rita Lucarelli (Department of Near Eastern Studies). The talks and discussions centered upon how data is being used in creative and compelling ways to tell stories, in addition to rewards and challenges of supporting groundbreaking research when the underlying research data is restricted.
Claudia von Vacano, the Director of D-Lab, discussed the Online Hate Index (OHI), a joint initiative of the Anti-Defamation League’s (ADL) Center for Technology and Society that uses crowd-sourcing and machine learning to develop scalable detection of the growing amount of hate speech within social media. In its recently-completed initial phase, the project focused on training a model based on an unbiased dataset collected from Reddit. Claudia explained the process, from identifying the problem, defining hate speech, and establishing rules for human coding, through building, training, and deploying the machine learning model. Going forward, the project team plans to improve the accuracy of the model and extend it to include other social media platforms.
Next, Garret S. Christensen, BIDS and BITSS fellow, talked about his experience with research data. He started by providing a background about his research, then discussed the challenges he faced in collecting his research data. The main research questions that Garret investigated are: How are people responding to military deaths? Do large numbers of, or high-profile, deaths affect people’s decision to enlist in the military?
Garret discussed the challenges of obtaining and working with the Department of Defense data obtained through a Freedom of Information Act request for the purpose of researching war deaths and military recruitment. Despite all the challenges that Garret faced and the time he spent on getting the data, he succeeded in putting the data together into a public repository. Now the information on deaths in the US Military from January 1, 1990 to November 11, 2010 that was obtained through Freedom of Information Act request is available on dataverse. At the end, Garret showed that how deaths and recruits have a negative relationship.
Orianna DeMasi, a graduate student of Computer Science and BIDS Fellow, shared her story of working with human subjects data. The focus of Orianna’s research is on building tools to improve mental healthcare. Orianna framed her story about collecting and working with human subject data as a fairy tale story. She indicated that working with human data makes security and privacy essential. She has learned that it’s easy to get blocked “waiting for data” rather than advancing the project in parallel to collecting or accessing data. At the end, Orianna advised the attendees that “we need to keep our eyes on the big problems and data is only the start.”
Rita Lucarelli, Department of Near Eastern Studies discussed the Book of the Dead in 3D project, which shows how photogrammetry can help visualization and study of different sets of data within their own physical context. According to Rita, the “Book of the Dead in 3D” project aims in particular to create a database of “annotated” models of the ancient Egyptian coffins of the Hearst Museum, which is radically changing the scholarly approach and study of these inscribed objects, at the same time posing a challenge in relation to data sharing and the publication of the artifacts. Rita indicated that metadata is growing and digital data and digitization are challenging.
It was fascinating to hear about Egyptology and how to visualize 3D ancient objects!
We closed out LDW 2018 at UC Berkeley with a session about Research Data Management Planning and Publishing. In the session, Daniella Lowenberg (University of California Curation Center) started by discussing the reasons to manage, publish, and share research data on both practical and theoretical levels.
Daniella shared practical tips about why, where, and how to manage research data and prepare it for publishing. She discussed relevant data repositories that UC Berkeley and other entities offer. Daniela also illustrated how to make data reusable, and highlighted the importance of citing research data and how this maximizes the benefit of research.
At the end, Daniella presented a live demo on using Dash for publishing research data and encouraged UC Berkeley workshop participants to contact her with any question about data publishing. In a lively debate, researchers shared their experiences with Daniella about working with managing research data and highlighted what has worked and what has proved difficult.
We have received overwhelmingly positive feedback from the attendees. Attendees also expressed their interest in having similar workshops to understand the broader perspectives and skills needed to help researchers manage their data.
I would like to thank BIDS and the University Library for sponsoring the events.
The University Library, Research IT, and Berkeley Institute for Data Science will host a series of events on February 12th-16th during the Love Data Week 2018. Love Data Week a nationwide campaign designed to raise awareness about data visualization, management, sharing, and preservation.
Please join us to learn about multiple data services that the campus provides and discover options for managing and publishing your data. Graduate students, researchers, librarians and data specialists are invited to attend these events to gain hands-on experience, learn about resources, and engage in discussion around researchers’ data needs at different stages in their research process.
To register for these events and find out more, please visit: http://guides.lib.berkeley.edu/ldw2018guide
Intro to Scopus APIs – Learn about working with APIs and how to use the Scopus APIs for text mining.
01:00 – 03:00 p.m., Tuesday, February 13, Doe Library, Room 190 (BIDS)
Refreshments will be provided.
Data stories and Visualization Panel – Learn how data is being used in creative and compelling ways to tell stories. Researchers across disciplines will talk about their successes and failures in dealing with data.
1:00 – 02:45 p.m., Wednesday, February 14, Doe Library, Room 190 (BIDS)
Refreshments will be provided.
Planning for & Publishing your Research Data – Learn why and how to manage and publish your research data as well as how to prepare a data management plan for your research project.
02:00 – 03:00 p.m., Thursday, February 15, Doe Library, Room 190 (BIDS)
Hope to see you there!
— Yasmina Anwar (@yasmina_anwar) May 2, 2017
On May 2 – 5 2017, I (Yasmin AlNoamany) was thrilled to attend the csv,conf,v3 2017 conference and the Software/Data Carpentry instructor training in Portland, Oregon, USA. It was a unique experience to attend and speak with many people who are passionate about data and open science.
— Yasmina Anwar (@yasmina_anwar) May 2, 2017
— Yasmina Anwar (@yasmina_anwar) May 2, 2017
— Yasmina Anwar (@yasmina_anwar) May 2, 2017
— V Ikeshoji-Orlati (@vikeshojiorlati) May 3, 2017
I presented on the second day about generating stories from archived data, which entitled “Using Web Archives to Enrich the Live Web Experience Through Storytelling”. Check out the slides of my talk below.
I demonstrated the steps of the proposed framework, the Dark and Stormy Archives (DSA), in which, we identify, evaluate, and select candidate Web pages from archived collections that summarize the holdings of these collections, arrange them in chronological order, and then visualize these pages using tools that users already are familiar with, such as Storify. For more information about this work, check out this post.
— Hamdan Azhar (@HamdanAzhar) May 3, 2017
The csv,conf deserved to won the conference of the year prize for bringing the CommaLlama. The Alpaca brought much joy and happiness to all conference attendees. It was fascinating to be in csv,conf 2017 to meet and hear from passionate people from everywhere about data.
— Yasmina Anwar (@yasmina_anwar) May 3, 2017
— Yasmina Anwar (@yasmina_anwar) May 3, 2017
— Yasmina Anwar (@yasmina_anwar) May 3, 2017
— Yasmina Anwar (@yasmina_anwar) May 3, 2017
— Yasmina Anwar (@yasmina_anwar) May 4, 2017
After the conference, Max Odgen from the Dat Data Project gave us a great tour from the conference venue to South Portland. We had a great food from street food trucks at Portland, then we had a great time with adorable neighborhood cats!
— Yasmina Anwar (@yasmina_anwar) May 4, 2017
Tha Carpentry Training
— Library Carpentry (@LibCarpentry) May 5, 2017
— Library Carpentry (@LibCarpentry) May 5, 2017
The two days had a mix of lectures and hands-on exercises about learning philosophy and Carpentry teaching practices. It was a unique and fascinating experience to have. We had two energetic instructors, Tim Dennis and Belinda Weaver, who generated welcoming and collaborate environment for us. Check out the full schedule and lessons from here.
— Elaine Wong (@elthenerd) May 5, 2017
At the end, I would like to acknowledge the support I had from the California Digital Library and the committee of the csv,conf for giving me this amazing opportunity to attend and speak at the csv,conf and the Carpentry instructor training. I am looking forward to applying what I learned in upcoming Carpentry workshops at UC Berkeley.
— Yasmina Anwar (@yasmina_anwar) February 14, 2017
This week, the University Library and the Research Data Management program were delighted to participate in the Love Your Data (LYD) Week campaign by hosting a series of workshops designed to help researchers, data specialists, and librarians to better address and plan for research data needs. The workshops covered issues related to managing, securing, publishing, and licensing data. Participants from many campus groups (e.g., LBNL, CSS-IT) were eager to continue the stimulating conversation around data management. Check out the full program and information about the presented topics.
Photographs by Yasmin AlNoamany for the University Library.
The first day of LYD week at UC Berkeley was kicked off by a discussion panel about Securing Research Data, featuring
Jon Stiles (D-Lab, Federal Statistical RDC), Jesse Rothstein (Public Policy and Economics, IRLE), Carl Mason (Demography). The discussion centered upon the rewards and challenges of supporting groundbreaking research when the underlying research data is sensitive or restricted. In a lively debate, various social science researchers detailed their experiences working with sensitive research data and highlighted what has worked and what has proved difficult.
At the end, Chris Hoffman, the Program Director of the Research Data Management program, described a campus-wide project about Securing Research Data. Hoffman said the goals of the project are to improve guidance for researchers, benchmark other institutions’ services, and assess the demand and make recommendations to campus. Hoffman asked the attendees for their input about the services that the campus provides.
On the second day, we hosted a workshop about the best practices for using Box and bDrive to manage documents, files and other digital assets by Rick Jaffe (Research IT) and Anna Sackmann (UC Berkeley Library). The workshop covered multiple issues about using Box and bDrive such as the key characteristics, and personal and collaborative use features and tools (including control permissions, special purpose accounts, pushing and retrieving files, and more). The workshop also covered the difference between the commercial and campus (enterprise) versions of Box and Drive. Check out the RDM Tools and Tips: Box and Drive presentation.
We closed out LYD Week 2017 at UC Berkeley with a workshop about Research Data Publishing and Licensing 101. In the workshop, Anna Sackmann and Rachael Samberg (UC Berkeley’s Scholarly Communication Officer) shared practical tips about why, where, and how to publish and license your research data. (You can also read Samberg & Sackmann’s related blog post about research data publishing and licensing.)
In the first part of the workshop, Anna Sackmann talked about reasons to publish and share research data on both practical and theoretical levels. She discussed relevant data repositories that UC Berkeley and other entities offer, and provided criteria for selecting a repository. Check out Anna Sackmann’s presentation about Data Publishing.
During the second part of the presentation, Rachael Samberg illustrated the importance of licensing data for reuse and how the agreements researchers enter into and copyright affects licensing rights and choices. She also distinguished between data attribution and licensing. Samberg mentioned that data licensing helps resolve ambiguity about permissions to use data sets and incentivizes others to reuse and cite data. At the end, Samberg explained how people can license their data and advised UC Berkeley workshop participants to contact her with any questions about data licensing.
Check out the slides from Rachael Samberg’s presentation about data licensing below.
The workshops received positive feedback from the attendees. Attendees also expressed their interest in having similar workshops to understand the broader perspectives and skills needed to help researchers manage their data.
Special thanks to Rachael Samberg for editing this post.