Wikiphiliacs, Unite! (At our Wikipedia Editathon, on Valentine’s Day, 2024)

"Edit for Change" Wikipedia Editathon date and time details, with a graphic looking like a crossword puzzle

I am a proud Wikiphiliac.  At least, according to the Urban Dictionary, which defines Wikiphilia as “a powerful obsession with Wikipedia”. I have many of the signs it warns of, including “accessing Wikipedia several times a day…spending much more time on Wikipedia than originally intended [and]… compulsively switching to other Wikipedia articles, using the hyperlinks within articles, often without obtaining the originally sought information and leaving a bizarre informational “trail” in his/her browsing history” (but that last part is just normal life as a librarian).

How else do I love Wikipedia?  Let me count the ways!  As a librarian, I always approach crowd-sourced information with a critical eye, but I also admire that Wikipedia has its own standards for fact-checking, and in fact some topics are locked to public editing.  It takes its mission very seriously.  It also has an accessible and neutral tone.  Especially when I want to learn about a technical topic, it can give me a straightforward and helpful way to approach it.  I also use it pretty routinely as a way to look at collections of sources about a topic; when I was a medical librarian, I was asked for data on the condition neurofibromatosis, and at that time the best basic links I found were in the references for the Wikipedia article.   Last and maybe most importantly, the fact that anyone can edit is a huge strength…with challenges.  Wikipedia openly admits its content is skewed by the gender and racial imbalance of its editors, and knowing this is part of approaching it critically, but it also means that IT CAN CHANGE, and WE CAN CHANGE IT.

Given that philia, a word taken from Ancient Greek (according to the philia Wikipedia article), means affection for or love of something, it’s fitting that our 2024 Wikipedia Editathon is part of UC’s Love Data Week, and happens on Valentine’s Day.   If you would like to learn to contribute to this amazing resource, and perhaps even help diversify its editorial pool, we can get you started!  There isn’t yet a Wikipedia page on Wikiphilia, but maybe you could create one!  There already is a podcast series

If you’re interested in learning more, we warmly welcome you and invite you to join us on Wednesday, February 14, from 1-2:30 for the 2024 UC Berkeley Libraries Wikipedia Editathon.  No experience is required—we will teach you all you need to know about editing!  (but, if you want to edit with us in real time, please create a Wikipedia account before the workshop—information on how to do that is on the registration page).  The link to register is here, and you can contact any of the workshop leaders with questions.  We hope you will join us, and we look forward to editing with you!

NOTE: the Wikipedia Editathon is just one of the programs that’s part of the University of California’s Love Data Week 2024!  Don’t forget to check out all the other great UC Love Data Week offerings—this year UC Berkeley Librarians are hosting/co-hosting SIX different sessions!  Here are those UCB-led workshop links, and the full calendar is linked here:

Thinking About and Finding Health Statistics & Data
GIS & Mapping: Where to Start
Cultivating Collaboration: Getting Started with Open Research
Code-free Data Analysis
Wikipedia Edit-a-thon 
Getting Started with Qualitative Data Analysis

Love Data Week calendar, with Berkeley-led offerings circled


Come Help Us Create Wikipedia and Create Change, Edit by Edit, on February 15, 2023!

Screenshot of Wikipedia Entry for the Movie Tár 1-20-23
Screenshot of Wikipedia Entry for the Movie Tár 1-20-23

Wikipedia has become so central to our lives that we count on it to represent reality, and solid fact. When we encounter a new phenomenon, we check out our trusty online friend for more information.  So, it was fascinating to me recently to see the lines blur between fiction and reality, when Wikipedia was used as a visual and social cue in the movie Tár, starring Cate Blanchett, about a famed female conductor.  In the movie, one of the clues to the coming turbulence in Lydia Tár’s life is a screen capture of a mystery editor changing items on the conductor’s Wikipedia entry. It looked and felt so real, the filming and Blanchett’s performance so rivetingly vivid, that many people believed the film was a biopic of a real person.   As Brooke LaMantia wrote in her article, No, Lydia Tar is Not Real,

“When I left the theater after watching Tár for two hours and 38 minutesI immediately fumbled for my phone. I couldn’t wait to see actual footage of the story I had just seen and was so ready for my Wikipedia deep dive to sate me during my ride home. But when I frantically typed “Lydia Tar?” into Google as I waited for my train, I was greeted with a confusing and upsetting realization: Lydia Tár is not real…the film’s description on Letterboxd — “set in the international world of classical music, centers on Lydia Tár, widely considered one of the greatest living composer/conductors and first-ever female chief conductor of a major German orchestra” — is enough to make you believe Tár is based on a true story. The description was later added to a Wikipedia page dedicated to “Lydia Tár,” but ahead of the film’s October 28 wide release, that page has now been placed under a broader page for the movie as a whole. Was this some sort of marketing sleight of hand or just a mistake I stumbled upon? Am I the only one who noticed this? I couldn’t be, right? I thought other people had to be stuck in that same cycle of questioning: Wait, this has to be real. Or is it? She’s not a real person?

Wikipedia is central to LaMantia’s questioning!  While it’s easy to understand people’s confusion in general, the Tár Wikipedia page, created by editors like you and like me, is very clear that this is a film, at least as of today’s access date, January 20, 2023… On the other hand, did you know you can click on the “View History” link on the page, and see every edit that has been made to it, since it was created, and who made that edit?  If you look at the page resulting from one of the edits from October 27, 2022, you can see that it does look like Tár is a real person, and in fact, a person who later went on to edit this entry to make it clearer wrote, “Reading as it was, it is not clear if Lydia actually exists.”  Maybe I should write to LaMantia and let her know.

I tell this story to show that clearly, Wikipedia is a phenomenon, and a globally central one, which makes it all the more amazing that it is created continuously, edit by edit, editor by editor.  There are many ways in which our own and your own edits can create change, lead to social justice, correct misinformation and more.  While it’s easy to get lost in the weeds of minute changes to esoteric entries, it’s also possible to improve pages on important figures in real-life history and bring them into our modern narrative and consciousness.  And it’s easy to do!

If you are interested in learning more, and being part of this central resource, we warmly welcome you and invite you to join us on Wednesday, February 15, from 1-2:30 for our 2023 Wikipedia Editathon, part of the University of Calif0rnia-wide 2023 Love Data Week.  No experience is required—we will teach you all you need to know about editing!  (but, if you want to edit with us in real time, please create a Wikipedia account before the workshop).  The link to register is here, and you can contact any of the workshop leaders (listed on the registration page) with questions.  We look forward to editing with you!


Coming Soon: Love Your Data, from Editathons to Containers!

UC Berkeley has been loving its data for a long time, and has been part of the international movement which is Love Data Week (LDW) since at least 2016, even during the pandemic!  This year is no exception—the UC Berkeley Libraries and our campus partners are offering some fantastic workshops (four of which are led by our very own librarians) as part of the University of California-wide observance.

Love Data Week 2023 is happening next month, February 13-17 (it’s always during the week of Valentine’s Day)!

University of California 2023 Love Data Week calendar with UC Berkeley offerings

UC Berkeley Love Data Week offerings for 2023 include:

GIS & Mapping: Where to Start

Wikipedia Edit-a-thon (you can also dip into Wikidata at other LDW events)

Introduction to Containers

Textual Analysis with Archival Materials

Getting Started with Qualitative Data Analysis

All members of the UC community are welcome—we hope you will join us!  Registration links for our offerings are above, and the full UC-wide calendar is here.   If you are interested in learning more about what the library is doing with data, check out our new Data + Digital Scholarship Services page.  And, feel free to email us at librarydataservices@berkeley.edu.   Looking forward to data bonding next month!


Love data? Join us for Love Data Week 2022, Feb. 14-18!

Once again, UC Libraries are collaborating on a UC-wide Love Data Week series of talks, presentations, and workshops Feb. 14-18, 2022. With over 30 presentations and workshops, there’s plenty to choose from, with topics such as:

  • How to write effective data management plans
  • Text analysis with Python
  • How and where to share your research data
  • Geospatial analysis with R and with Jupyter Notebooks
  • Data ethics & justice
  • Cleaning and coding data for qualitative analysis
  • Software management for researchers
  • An introduction to databases for newspapers and social science data
  • 3-D data, visualization, and mapping

All members of the UC community are invited to attend these events to gain hands-on experience, learn about resources, and engage in discussions about data needs throughout the research process. To register for workshops during this week and see what other sessions will be offered UC-wide, visit the UC Love Data Week 2022 website.


Love Data? Join Us During Love Data Week 2021, Feb 8-12!

Love Data Week 2021

Since our Love Data Week invitation post last year, the COVID pandemic has created a new world— and amazing new opportunities and challenges related to data.  Just a peek at data.berkeley.edu (the portal for Berkeley’s Computing, Data Science, and Society Division) shows that data-related research during this past pandemic year, even with its intense and difficult challenges, has revealed new insights.  Check out “Pandemic provides real-time experiment for diagnosing, treating misinformation, disinformation”.*  

So, it’s fitting that Love Data Week 2021 at Berkeley, hosted by the UC Berkeley Library in partnership with Berkeley’s Research IT department, is focused on the kinds of issues we are confronted with in a wholly-online research environment.  Join us on Tuesday for a session on ethical considerations in data, most definitely a concern with many of Berkeley’s researchers looking at issues related to COVID; on Wednesday for a talk on cybersecurity (aimed at graduate researchers but all are welcome); on Thursday for another security-related workshop, “Getting Started with LastPass & Veracrypt”; and on Friday for an introduction to Savio, Berkeley’s high performance computing cluster.  Please click on this link for information on these, and registration links!

Questions?  E-mail LDW 2021 at researchdata@berkeley.edu .  And, if we’ve whetted your appetite for data and more data, take a look at the University of California-wide Love Data Week offerings.  If you’ve ever wondered what an API is, or want a quick intro to SQL, or even just want to know what the acronyms stand for, there are these sessions and more!

*  The same page makes it clear that data is for everyone; check out “I Am a Data Scientist”, about a student who came to Berkeley as an English major and discovered how data can “shed light on larger-scale questions”, and “Translating Numbers Into Words: The Art of Writing About Data Science”, featuring three Berkeleyites who are getting the word out about data.

 


UC Berkeley celebrates Love Data Week with great talks and tips!

Last week, the University Library, the Berkeley Institute for Data Science (BIDS), the Research Data Management program were delighted to host Love Data Week (LDW) 2018 at UC Berkeley. Love Data Week is a nationwide campaign designed to raise awareness about data visualization, management, sharing, and preservation. The theme of this year’s campaign was data stories to discuss how data is being used in meaningful ways to shape the world around us.

At UC Berkeley, we hosted a series of events designed to help researchers, data specialists, and librarians to better address and plan for research data needs. The events covered issues related to collecting, managing, publishing, and visualizing data. The audiences gained hands-on experience with using APIs, learned about resources that the campus provides for managing and publishing research data, and engaged in discussions around researchers’ data needs at different stages of their research process.

Participants from many campus groups (e.g., LBNL, CSS-IT) were eager to continue the stimulating conversation around data management. Check out the full program and information about the presented topics.

Photographs by Yasmin AlNoamany for the University Library and BIDS.

Eric Livingston explains the difference between Elsevier APIs.

LDW at UC Berkeley was kicked off by a walkthrough and demos about Scopus APIs (Application Programming Interface), was led by Eric Livingston of the publishing company, Elsevier. Elsevier provides a set of APIs that allow users to access the content of journals and books published by Elsevier.

In the first part of the session, Eric provided a quick introduction to APIs and an overview about Elsevier APIs. He illustrated the purposes of different APIs that Elsevier provides such as DirectScience APIs, SciVal API, Engineering Village API, Embase APIs, and Scopus APIs. As mentioned by Eric, anyone can get free access to Elsevier APIs, and the content published by Elsevier under Open Access licenses is fully available. Eric explained that Scopus APIs allow users to access curated abstracts and citation data from all scholarly journals indexed by Scopus, Elsevier’s abstract and citation database. He detailed multiple popular Scopus APIs such as Search API, Abstract Retrieval API, Citation Count API, Citation Overview API, and Serial Title API. Eric also overviewed the amount of data that Scopus database holds.

The attendees conduct live queries on Scopus APIs.

In the second half of the workshop, Eric explained how Scopus APIs work, how to get a key to Scopus APIs, and showed different authentication methods. He walked the group through live queries, showed them how to extract data from API and how to debug queries using the advanced search. He talked about the limitations of the APIs and provided tips and tricks for working with Scopus APIs.

 

Eric explains code snippets for querying Scopus APIs.

Eric left the attendances with actionable and workable code and scripts to pull and retrieve data from Scopus APIs.

The Data Stories and Visualization Panel.

On the second day, we hosted a Data Stories and Visualization Panel, featuring Claudia von Vacano (D-Lab), Garret S. Christensen (BIDS and BITSS), Orianna DeMasi (Computer Science and BIDS), and Rita Lucarelli (Department of Near Eastern Studies). The talks and discussions centered upon how data is being used in creative and compelling ways to tell stories, in addition to rewards and challenges of supporting groundbreaking research when the underlying research data is restricted.

Claudia von Vacano talks about the Online Hate Index project.

Claudia von Vacano, the Director of D-Lab, discussed the Online Hate Index (OHI), a joint initiative of the Anti-Defamation League’s (ADL) Center for Technology and Society that uses crowd-sourcing and machine learning to develop scalable detection of the growing amount of hate speech within social media. In its recently-completed initial phase, the project focused on training a model based on an unbiased dataset collected from Reddit. Claudia explained the process, from identifying the problem, defining hate speech, and establishing rules for human coding, through building, training, and deploying the machine learning model. Going forward, the project team plans to improve the accuracy of the model and extend it to include other social media platforms.

Garret S. Christensen talks about his experience with research data.

Next, Garret S. Christensen, BIDS and BITSS fellow, talked about his experience with research data. He started by providing a background about his research, then discussed the challenges he faced in collecting his research data. The main research questions that Garret investigated are: How are people responding to military deaths? Do large numbers of, or high-profile, deaths affect people’s decision to enlist in the military?

Garret shows the relation between US deaths rate and the number of applicants to the military.

Garret discussed the challenges of obtaining and working with the Department of Defense data obtained through a Freedom of Information Act request for the purpose of researching war deaths and military recruitment. Despite all the challenges that Garret faced and the time he spent on getting the data, he succeeded in putting the data together into a public repository. Now the information on deaths in the US Military from January 1, 1990 to November 11, 2010 that was obtained through Freedom of Information Act request is available on dataverse. At the end, Garret showed that how deaths and recruits have a negative relationship.

Orianna DeMasi related her experience of working on real problems with real human data to taming dragons.

Orianna DeMasi, a graduate student of Computer Science and BIDS Fellow, shared her story of working with human subjects data. The focus of Orianna’s research is on building tools to improve mental healthcare. Orianna framed her story about collecting and working with human subject data as a fairy tale story. She indicated that working with human data makes security and privacy essential. She has learned that it’s easy to get blocked “waiting for data” rather than advancing the project in parallel to collecting or accessing data. At the end, Orianna advised the attendees that “we need to keep our eyes on the big problems and data is only the start.”

Rita Lucarelli is discussing the Book of the Dead in 3D.

Rita Lucarelli, Department of Near Eastern Studies discussed the Book of the Dead in 3D project, which shows how photogrammetry can help visualization and study of different sets of data within their own physical context. According to Rita, the “Book of the Dead in 3D” project aims in particular to create a database of “annotated” models of the ancient Egyptian coffins of the Hearst Museum, which is radically changing the scholarly approach and study of these inscribed objects, at the same time posing a challenge in relation to data sharing and the publication of the artifacts. Rita indicated that metadata is growing and digital data and digitization are challenging.

It was fascinating to hear about Egyptology and how to visualize 3D ancient objects!

Daniella Lowenberg presents on Research Data Management Planning and Publishing.

We closed out LDW 2018 at UC Berkeley with a session about Research Data Management Planning and Publishing. In the session, Daniella Lowenberg (University of California Curation Center) started by discussing the reasons to manage, publish, and share research data on both practical and theoretical levels.

Daniella shares tips about publishing research data.

Daniella shared practical tips about why, where, and how to manage research data and prepare it for publishing. She discussed relevant data repositories that UC Berkeley and other entities offer. Daniela also illustrated how to make data reusable, and highlighted the importance of citing research data and how this maximizes the benefit of research.

Daniella’s live demo on using Dash for publishing research data.

At the end, Daniella presented a live demo on using Dash for publishing research data and encouraged UC Berkeley workshop participants to contact her with any question about data publishing. In a lively debate, researchers shared their experiences with Daniella about working with managing research data and highlighted what has worked and what has proved difficult.

We have received overwhelmingly positive feedback from the attendees. Attendees also expressed their interest in having similar workshops to understand the broader perspectives and skills needed to help researchers manage their data.

I would like to thank BIDS and the University Library for sponsoring the events.

Yasmin AlNoamany

 


Love Data Week 2018 at UC Berkeley

The University Library,  Research IT,  and Berkeley Institute for Data Science will host a series of events on February 12th-16th during the Love Data Week 2018. Love Data Week a nationwide campaign designed to raise awareness about data visualization, management, sharing, and preservation.
Please join us to learn about multiple data services that the campus provides and discover options for managing and publishing your data. Graduate students, researchers, librarians and data specialists are invited to attend these events to gain hands-on experience, learn about resources, and engage in discussion around researchers’ data needs at different stages in their research process.
To register for these events and find out more, please visit: http://guides.lib.berkeley.edu/ldw2018guide

Schedule:
Intro to Scopus APIs –  Learn about working with APIs and how to use the Scopus APIs for text mining.
01:00 – 03:00 p.m., Tuesday, February 13, Doe Library, Room 190 (BIDS)
Refreshments will be provided.

Data stories and Visualization Panel – Learn how data is being used in creative and compelling ways to tell stories. Researchers across disciplines will talk about their successes and failures in dealing with data.
1:00 – 02:45 p.m., Wednesday, February 14, Doe Library, Room 190 (BIDS)
Refreshments will be provided.

Planning for & Publishing your Research Data – Learn why and how to manage and publish your research data as well as how to prepare a data management plan for your research project.
02:00 – 03:00 p.m., Thursday, February 15, Doe Library, Room 190 (BIDS)

Hope to see you there!

–Yasmin