<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Data Culture Group @ Northeastern University</title>
    <description>Interrogating our datafied society</description>
    <link>https://dataculturegroup.org/</link>
    <atom:link href="https://dataculturegroup.org/feed.xml" rel="self" type="application/rss+xml"/>
    <pubDate>Wed, 11 Feb 2026 09:46:13 +0000</pubDate>
    <lastBuildDate>Wed, 11 Feb 2026 09:46:13 +0000</lastBuildDate>
    <generator>Jekyll v3.10.0</generator>
    
      <item>
        <title>Data Journalism in Dark Times: notes from the C+J keynotes</title>
        <description>&lt;p&gt;I just returned from the &lt;a href=&quot;https://cplusj2025.com&quot;&gt;Computation + Journalism Symposium in Miami&lt;/a&gt;, and I’m still processing the weight of what I heard. The four keynotes this year painted a sobering picture of data journalism operating under pressure—from authoritarian governments, algorithmic complexity, and information warfare. But they also offered glimpses of resilience and innovation that feel essential right now. Here are short summaries of each keynote, drafted with some AI assistance based on my live notes.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/cplusj-25-selfie.jpg&quot; alt=&quot;selfie of my standing outside in front of a building with signage saying &amp;quot;University of Miami&amp;quot;&quot; /&gt;
&lt;em&gt;Me ready for Miami in December ☀️&lt;/em&gt;&lt;/p&gt;

&lt;h2 id=&quot;when-data-journalism-isnt-enough&quot;&gt;When Data Journalism Isn’t Enough&lt;/h2&gt;

&lt;p&gt;&lt;a href=&quot;https://attilabatorfy.com&quot;&gt;Attila Bátorfy&lt;/a&gt; opened with 15 years of experience doing data journalism in Hungary under Viktor Orbán’s increasingly authoritarian government. His message was blunt: sophisticated data journalism hasn’t stopped democratic erosion. Despite producing groundbreaking investigations–including tracking private jets to predict and photograph secret meetings of oligarchs–the work had limited impact on political reality.&lt;/p&gt;

&lt;p&gt;What struck me most was his catalog of how governments actively undermine data journalism: releasing low-resolution data, commodifying public information, creating datasets with bad methodology, and even sponsoring “independent” counter-data. His COVID dashboard became a trusted source (5 million uniques in 2 years, cited in dozens of scientific journals) precisely because government data was so unreliable. But he’s now moving away from data journalism, noting that audiences “don’t want facts” and that Hungarian newsrooms see it as expensive with bad ROI. The one exception? Sports and health data still cut through the noise. Attila’s upcoming work focuses on historical representatos of data, and I’m excited to learn from what he shares.&lt;/p&gt;

&lt;h2 id=&quot;data-as-weapon-and-shield&quot;&gt;Data as Weapon and Shield&lt;/h2&gt;

&lt;p&gt;&lt;a href=&quot;https://jsk.stanford.edu/people/kae-petrin&quot;&gt;Kae Petrin’s&lt;/a&gt; talk on anti-LGBTQ+ data policies brought this theme of governmental manipulation home to the US context. The Trump administration’s systematic removal and alteration of CDC data—changing “gender” to “sex,” taking down youth risk behavior data, adding warnings that datasets are “threats to society”—isn’t just censorship. It’s erasure through datafication.&lt;/p&gt;

&lt;p&gt;Petrin highlighted a crucial methodological problem that journalists fumbled: when the CDC improved its estimation methods for trans youth populations, the numbers roughly doubled. News coverage reported a “sharp rise” in trans youth, playing into “social contagion” narratives, when the real story was that better measurement revealing what was always there. Meanwhile, the demographic data showing far more trans young people than older adults raises haunting questions about where trans adults over 24 are. These are questions we can’t answer if the data disappears.&lt;/p&gt;

&lt;p&gt;But Petrin also showed data’s dark side: data can make people legible and vulnerable.&lt;/p&gt;

&lt;h2 id=&quot;user-agency-vs-algorithmic-power&quot;&gt;User Agency vs. Algorithmic Power&lt;/h2&gt;

&lt;p&gt;&lt;a href=&quot;https://comm.ucla.edu/person/homa-hosseinmardi/&quot;&gt;Homa Hosseinmardi’s&lt;/a&gt; work offered a different kind of complexity, and reveals that the villain might not be who we think. Her research on YouTube radicalization used an innovative “counterfactual bots” method to separate user intention from algorithmic influence. The finding? Users who relied exclusively on YouTube’s recommender actually consumed less partisan content than those who actively searched and navigated themselves.&lt;/p&gt;

&lt;p&gt;This doesn’t absolve platforms of responsibility, but it challenges the narrative of algorithms as “great radicalizers.” The real picture is messier: a small group of users consuming vast amounts of far-right content, driven largely by external links and their own preferences. When her team simulated users without agency (random video selection, first recommendation only, etc.), those bots were showng content that drifted towards the politcal center over time. The algorithm appears to reflect and amplify existing preferences rather than creating them from scratch.&lt;/p&gt;

&lt;p&gt;The implications for journalism are significant: we may be overexamining algorithms while overlooking user agency and the broader information ecosystem that drives people to platforms in the first place.&lt;/p&gt;

&lt;h2 id=&quot;navigating-narrative-warfare&quot;&gt;Navigating Narrative Warfare&lt;/h2&gt;

&lt;p&gt;Yevheniia Drozdova closed with work from &lt;a href=&quot;https://texty.org.ua&quot;&gt;texty.org.ua&lt;/a&gt;, showing what data journalism looks like when your country is literally at war. Working from Kyiv, her team has moved beyond fact-checking individual claims to mapping entire narrative ecosystems. They train models to identify manipulation techniques, track coordinated bot networks across Telegram (where 50%+ of Ukrainians get news), and detect patterns in TikTok campaigns using shared filters and timing signatures.&lt;/p&gt;

&lt;p&gt;Her team found 2,000 TikTok bot accounts and documented coordinated Telegram networks. They trained models to detect emotional manipulation techniques, finding that 90% of messages from Russian accounts use fear and doubt. But their work isn’t just technical detection, it’s about providing context before narratives go viral. As she put it: “Fast news only satisfies the hunger for information, while slow journalism addresses a different deficit—deficit of understanding.”&lt;/p&gt;

&lt;h2 id=&quot;what-stays-with-me&quot;&gt;What Stays With Me&lt;/h2&gt;

&lt;p&gt;These four talks trace a dark arc: data journalism struggling against authoritarian data practices, vulnerable populations made more visible and endangered through datafication, algorithmic explanations that may miss human complexity, and full-scale information warfare requiring new methodological approaches.&lt;/p&gt;

&lt;p&gt;But I’m also struck by the practitioners’ persistence. Bátorfy’s COVID dashboard becoming a trusted institution. Petrin tracking articles through &lt;a href=&quot;https://transnewsinitiative.org&quot;&gt;the Trans News Initiative&lt;/a&gt; to understand coverage patterns. Hosseinmardi’s methodological innovation in creating counterfactual bots. Drozdova’s team prebunking narratives in real-time during a war.&lt;/p&gt;

&lt;p&gt;These keynotes show various ways data journalism remains essential for documentation, for providing sources of truth, for understanding complex systems, and for serving communities when governments fail them. C+J’25 offered a critical moment of trans-national solidarity that I won’t soon forget.&lt;/p&gt;
</description>
        <pubDate>Sun, 14 Dec 2025 15:00:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2025/12/14/cpluj-2025.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2025/12/14/cpluj-2025.html</guid>
        
        
        <category>journalism</category>
        
        <category>events</category>
        
        <category>visualization</category>
        
      </item>
    
      <item>
        <title>Notes from MISI Summit on Building Trust in Climate and Science Journalism</title>
        <description>&lt;p&gt;I attended the inaugural &lt;a href=&quot;https://combeyond.bu.edu/offering/misi-summit/&quot;&gt;Boston University Center for Media Innovation &amp;amp; Social Impact (MISI) Summit&lt;/a&gt; recently, where a diverse group of leaders came together to discuss the theme of “Communicating Climate.” Hosted by the Center’s Directory &lt;a href=&quot;https://www.bu.edu/com/profile/eric-gordon/&quot;&gt;Eric Gordon&lt;/a&gt;, the event included sessions on related topics. Here’s a quick blog post about key points from one session that focused on how journalism, academia, and public media can restore trust in an era of economic instability and eroding credibility.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/MISI-panel.jpg&quot; alt=&quot;photo from behind audience at opening of panel talk, with speakers on stage and slide introducing them on a big projected screen behind&quot; /&gt;
&lt;em&gt;The start of the “Business of News” Panel&lt;/em&gt;&lt;/p&gt;

&lt;h2 id=&quot;the-business-of-news-audience-trust-and-the-future-of-science-and-climate-policy-journalism&quot;&gt;The Business of News: Audience Trust and the Future of Science and Climate Policy Journalism&lt;/h2&gt;

&lt;p&gt;This panel included &lt;a href=&quot;https://www.bu.edu/com/profile/brian-mcgrory/&quot;&gt;Brian McGrory&lt;/a&gt; (BU Journalism Dept Chair, formed Boston Globe editor), &lt;a href=&quot;https://www.suerobinson.org&quot;&gt;Sue Robinson&lt;/a&gt; (Helen Firstbrook Franklin Professor of Journalism at the University of Wisconsin–Madison), and &lt;a href=&quot;https://www.wbur.org/inside/staff/dan-mauzy&quot;&gt;Dan Mauzy&lt;/a&gt; (WBUR’s Executive Editor for News).&lt;/p&gt;

&lt;p&gt;It was moderated by &lt;a href=&quot;https://www.bu.edu/com/profile/meghan-irons/&quot;&gt;Meghan Irons&lt;/a&gt; (investigative journalist and Professor of the Practice at Boston University), who opened with the fundamental question: how do we restore public trust when newsrooms face economic instability, eroding credibility, and an increasingly complex information landscape?&lt;/p&gt;

&lt;h3 id=&quot;local-news-attention-and-transparency&quot;&gt;Local News, Attention, and Transparency&lt;/h3&gt;

&lt;p&gt;Dan Mauzy opened with an optimistic take: &lt;strong&gt;local news has a leg up compared to national outlets, and public media has an advantage among those familiar with it.&lt;/strong&gt; He focused don transparency as a tool for building trust, particularly related to showing how journalism happens and showing your work. Dan noted a sobering statistic: “8 or 10 people have never spoken to a local reporter”—a problem that needs solving. In parallel, to ensure they are representative WBUR conducts audits on their source demographics, comparing them to local census data to ensure representative coverage.&lt;/p&gt;

&lt;p&gt;For climate coverage specifically, the answer is making it local: to “literally bring it home to people.” Instead of abstract global warming statistics, WBUR focuses on local watering holes, jobs, heat pumps, energy bills, and solutions. Dan pointed to their &lt;a href=&quot;https://www.wbur.org/news/2024/02/12/cape-cod-septic-systems-sewers-solutions-cost&quot;&gt;Cape Cod plumbing contamination story&lt;/a&gt; about nitrates as an example of making climate impacts tangible. WBUR now has 3 reporters plus an editor dedicated to their climate team.&lt;/p&gt;

&lt;p&gt;Brian McGrory acknowledged the widespread distrust across institutions, but noted an interesting pattern: &lt;strong&gt;crises drive attention&lt;/strong&gt;. The Boston bombing, Trump’s first presidency, and COVID all led to subscription bumps for the Globe. Digital subscriptions actually doubled during the pandemic. “These are votes of confidence on things we do and how we do it,” Brian said. But he expressed uncertainty about how to reach people who aren’t actively seeking out journalism without alienating current loyal readers. His philosophy: “when readers see things they like, admire, trust, they will stay, otherwise they will leave.” The answer for him? “Focus on interesting, important, consequential stories and the business model will follow.”&lt;/p&gt;

&lt;p&gt;Sue Robinson opened arguing that journalism education and newsrooms aren’t teaching key skillsets: &lt;strong&gt;radical transparency vis-à-vis audience engagement and restoring trust&lt;/strong&gt;. She pointed to organizations modeling this approach: ProPublica, Seattle Times, WITF, Baltimore Banner. &lt;a href=&quot;https://www.witf.org/news/elections/&quot;&gt;WITF’s 2024 election coverage&lt;/a&gt; particularly stood out to Sue as a model, with Q&amp;amp;As, pledges about misinformation and accuracy, and messaging about what’s happening behind the story.&lt;/p&gt;

&lt;p&gt;She pushed back on Brian’s faith in the story, pointing out that &lt;strong&gt;“The audience doesn’t come to you—they’re being told to hate us.”&lt;/strong&gt; Instead of waiting for audiences to find them, news organizations need to actively work on finding common values with communities.&lt;/p&gt;

&lt;h3 id=&quot;whats-actually-working&quot;&gt;What’s Actually Working&lt;/h3&gt;

&lt;p&gt;Sue shared how listening sessions can change hearts and minds of those reaaders who are being told to disengage from local news. Through the &lt;a href=&quot;https://trustingnews.org&quot;&gt;Trusting News project&lt;/a&gt;, she works with journalists who talked to people disengaged from news brands. The sessions were challening for journalists who had to listen with “devastating results that traumatized [them].” But here’s the remarkable part: &lt;strong&gt;after those conversations, over half of participants said it built confidence and one-third said they’d probably subscribe.&lt;/strong&gt; The takeaway? “Bring people together around a shared topic of concern—listening and convening can change hearts and minds.”&lt;/p&gt;

&lt;p&gt;Dan echoed this from WBUR’s experience hosting listening sessions on various topics, particularly in response to parachute journalism in the Merrimack Valley. It’s “enriching for reporters to hear what people think is overlooked, what makes them proud.” The challenge? How to scale these personal interactions.&lt;/p&gt;

&lt;p&gt;Brian agreed that focus groups are great, creating opportunties where meeting the person behind the reporting can “change the tune of the haters”. He pushes to have journalists that get out into community regularly: “in local news we can do that.” Brian also emphasized inviting readers into the process itself—not just the meta aspects like ethics policies, but the exposing the actual work of reporting. When people understand the process, trust follows.&lt;/p&gt;

&lt;p&gt;Sue highlighted that public media is out front on innovative approaches. She pointed to &lt;a href=&quot;https://lookout.co/&quot;&gt;Lookout Santa Cruz&lt;/a&gt; as a leading example. They partner with influencers in the community itself to scale their reach, working with people who have huge groups of followers not already connected to their local media organizations. Lookout is even fundraising with non-profits together. We “need more creativity like this,” Sue urged.&lt;/p&gt;

&lt;p&gt;Dan shared that WBUR is doing more vertical video now and reaching new audiences. They’ve done partnerships with trusted community members on niche topics, which has expanded reach and built trust. The catch? “Platforms are not our friends, so how to build loyal communities developed within those platforms is challenging.”&lt;/p&gt;

&lt;h3 id=&quot;the-hard-questions&quot;&gt;The Hard Questions&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Are Some People Unreachable?&lt;/strong&gt; Brian posed a difficult question about major outlets like the NYT, WSJ, and Globe: while subscribers are growing, they’re “not reaching a critical part of the country.” His question: “BUT are those people reachable, and what might be the sacrifice to reach them?”&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Talent Pipeline is Broken&lt;/strong&gt; Sue ended on a sobering economic note: pay rates aren’t good enough, so students coming out can’t afford to go into traditional newsrooms. Instead they’re going to NGOs and corporate communications to exercise similar skills. This threatens the entire pipeline of journalism talent.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Continually Shifting Media Ecosystem&lt;/strong&gt; Dan raised another challenge for the future: “how to create loyalty in a fractured media space? Discovery is harder in an AI-mediated internet search world.” As platforms and AI change how people find information, news organizations need new strategies for building direct relationships with audiences.&lt;/p&gt;

&lt;p&gt;I left with a sense of both sobering challenges and promising experiments. While there’s no silver bullet for restoring trust, the path forward Sue, Brian, and Dan pointed at focused on doubling down on transparency, community engagement, and creativity. I can get behind that.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/MISI-views.jpg&quot; alt=&quot;photo out window high above picturesque view down Charles River towards Boston&quot; /&gt;
&lt;em&gt;The views out the new-ish BU Computer and Data Science building are alwasy amazing 🍁☀️💦&lt;/em&gt;&lt;/p&gt;
</description>
        <pubDate>Mon, 10 Nov 2025 15:00:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2025/11/10/MISI-climate-journalism-trust.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2025/11/10/MISI-climate-journalism-trust.html</guid>
        
        
        <category>journalism</category>
        
        <category>events</category>
        
      </item>
    
      <item>
        <title>Notes from csv,conf,v9: Community-Centered Data Stories</title>
        <description>&lt;p&gt;The University of Bologna graciously hosted &lt;a href=&quot;https://csvconf.com&quot;&gt;csv,conf,v9&lt;/a&gt; in Palazzo Malvezzi. Accordion music drifted in through the windows while we gathered, echoing the mix of technical innovation and creative application that folks shared. Speakers from around the world shared how they’re using data not just to inform, but to transform communities. Here are gently editing notes from the sessions I attended (edited with some help from AI); any errors are probably my fault.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/csvconf9-attendees.jpg&quot; alt=&quot;photo of 2nd floor of building showing lots of people standing within arches&quot; /&gt;
&lt;em&gt;Photo of all the attendees at csv,conf,v9 (source: &lt;a href=&quot;https://bsky.app/profile/chodacki.bsky.social/post/3lyi7go6ils2l&quot;&gt;John Chodacki&lt;/a&gt;)&lt;/em&gt;&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;filling-information-voids-with-chatbots-using-llms-on-whatsapp-for-news-access&quot;&gt;Filling Information Voids with Chatbots: Using LLMs on WhatsApp for News Access&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Mathias Felipe (he/him), InfoAmazonia&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;In the Amazon forest, communities often face critical information voids around environmental issues, while relying heavily on WhatsApp as their main communication platform. This talk presents BOTO, an open-source chatbot developed by InfoAmazonia that delivers localized, thematic environmental information—such as deforestation alerts and wildfires—through WhatsApp. Powered by Large Language Models (LLMs) and designed with a low-bandwidth architecture, BOTO aims to make local news information more accessible in under-served areas. In this talk, we’ll discuss how BOTO was created with participatory design to empower communities, bridge information gaps, and put news directly in the hands of those who need it most.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/csvconf9-boto.JPG&quot; alt=&quot;a man in sunglasses standing in front of a projector showing a logo and whatsapp messages&quot; /&gt;
&lt;em&gt;Felipe showing what Boto looks like in action&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The most striking thing about &lt;a href=&quot;https://infoamazonia.org/en/project/boto/&quot;&gt;BOTO&lt;/a&gt; isn’t its technical architecture, others are also exploring WhatsApp Business API integration with RAG-based article search; it’s how InfoAmazonia approached the design process. Focus groups with different indigenous communities shaped every decision, from interface choices to content summaries. The result? A system where people can select their region and themes, then receive data summaries and links directly to their phones.&lt;/p&gt;

&lt;p&gt;Felipe emphasized something crucial: their audience doesn’t have a problem with chatbots. The hesitation comes from outside assumptions about what “non-technical users” want. When you actually ask communities what they need, the solutions become clearer. BOTO soft-launched recently with official rollout planned for next month, sponsored by &lt;a href=&quot;https://www.icfj.org&quot;&gt;ICFJ&lt;/a&gt;. The real test will be sustained engagement, but the co-design foundation suggests they’re building something people actually want to use.&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;blazing-fast-automagical-metadata&quot;&gt;Blazing Fast Automagical Metadata&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Joel Natividad&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;With the pervasiveness of low quality, low resolution metadata in existing Data Catalogs, and new metadata standards like DCAT3 and Croissant demanding even more FAIR metadata, how do you make it easier for Data Stewards to maintain High Quality Data Catalogs?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/csvconf9-natividad.JPG&quot; alt=&quot;a wide photo of an ornate room with a man sitting up front behind a desk and a projected slide to his right&quot; /&gt;
&lt;em&gt;Natividad giving his talk in a somewhat distractingly ornate room at the palace&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Natividad’s (from &lt;a href=&quot;https://dathere.com&quot;&gt;datHere&lt;/a&gt;) talk hit on every data steward’s nightmare: the metadata maintenance burden. New standards like DCAT3 and Croissant (for machine learning) demand even richer metadata: properties, data dictionaries, summary stats, frequency tables. The solution? Flip the workflow. Instead of forcing metadata entry before data upload, let people upload first and generate suggested metadata automatically. Their &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;qsv&lt;/code&gt; and &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;describegpt&lt;/code&gt; tools infer data dictionaries, descriptions, and 40+ summary stats. It creates a “data steward in-the-loop” approach—human oversight with automated assistance.&lt;/p&gt;

&lt;p&gt;Most intriguingly, they’re moving toward “chat with your catalog”—custom chatbots that can surface relevant CKAN datasets when answering questions. It’s RAG applied to data discovery, potentially transforming how people find and use  datasets that are published and publicly available.&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;building-csv-powered-tools-for-social-sciences&quot;&gt;Building CSV-powered tools for social sciences&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Guillaume Plique (he/him), médialabSciencesPo, Paris&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;CSV is ubiquitous in social sciences and in the humanities. CSV data is indeed the perfect bridge between social scientists, accustomed to dealing with tabular data, and research engineers needing to process the same data. That is why SciencesPo’s médialab has been building many of its Open-Source tools around CSV files, from well-designed web apps such as Table2Net to convert tabular data into graph data, down to powerful CLI tools such as minet to collect data from the web or xan to process tabular data using constrained resources.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/csvconf9-plique.JPG&quot; alt=&quot;a man with dark hair and a beard sitting at a computer with a large slide about CSV content behind him&quot; /&gt;
&lt;em&gt;Plique, and the lab where he works, are _all_ about CSV&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Plique delivered &lt;a href=&quot;https://github.com/medialab/xan/blob/master/docs/LOVE_LETTER.md&quot;&gt;a love letter to CSV&lt;/a&gt;. His argument is compelling: CSV is affordable, understandable, and free. It functions as a bridge between researchers, students, and engineers. It doesn’t require complex compute resources and naturally encourages data sobriety. The &lt;a href=&quot;https://medialab.sciencespo.fr&quot;&gt;médialab’s&lt;/a&gt; tool ecosystem is impressive—&lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;table2net&lt;/code&gt; for web-based CSV-to-graph conversion, &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;takoyaki&lt;/code&gt; for clustering CSV data to find clerical errors, &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;minet&lt;/code&gt; for CLI-based CSV processing, and &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;xan&lt;/code&gt; for visualization. All built on the principle that tabular data is inherently more comprehensible than JSON hierarchies. His closing advice: “If you need random access, use SQLite. Otherwise, CSV.” It’s a reminder that sometimes the simplest tools are the most durable—you’ll still be able to open a CSV file in 50 years.&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;keynote-developing-tech-with-community-the-example-of-open-data-editor&quot;&gt;Keynote: Developing tech with community: the example of Open Data Editor&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Sara Petti, Open Knowledge Foundation&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;Open Data Editor is a desktop application specifically designed to help people detect errors in tables. It has been developed in constant interaction with the community from a very early stage. These interactions helped us understand what was really helping the community and what not, and especially made us aware of how much the use of such a tool could actually be helpful in increasing data literacy.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/csvconf9-petti.JPG&quot; alt=&quot;a woman sits behind a desk with a computer and microphone, with two screenshots on a slide behind her&quot; /&gt;
&lt;em&gt;Petti dove into how they built a data cleanup and analysis tool with partners&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Petti’s keynote was a masterclass in community-centered design. &lt;a href=&quot;https://okfn.org/en/&quot;&gt;OKFN&lt;/a&gt; initially tried to build “an app to do it all”—review data, prepare it, map it, write articles. The scope was overwhelming and misaligned with user needs. The breakthrough came through extended pilot testing with organizations like ACIJ in Argentina and StoryData in Spain. This led to dramatic simplification and focus. The second round of pilots with five different organizational types revealed something crucial: researchers worried about changing metadata, navigation and design were hugely impactful, and people don’t read documentation.&lt;/p&gt;

&lt;p&gt;Key lessons that resonated: language matters enormously (“validation,” “execute” carry technical baggage), there’s only so much you can do to make highly technical tasks accessible, and many challenges are fundamentally non-technical. Most importantly—iterate based on feedback, keep things simple, and don’t be confused by the latest tech. The project pivoted from React to a more sustainable framework, emphasizing the importance of maintainable code over popular technologies.&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;democratizing-data-data-literacy-for-community-action&quot;&gt;Democratizing data: data literacy for community action&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Emily Zoe Mann (she/her/hers), University of South Florida Libraries&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;This presentation will share the results of Data Literacy for Community Action, a year-long pilot in the community of St. Petersburg, Florida, USA that provided data literacy to non-profit groups and interested members of the community, with a focus on using local data sets and data related to social determinants of health.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/csvconf9-mann.JPG&quot; alt=&quot;photo of a projected slide showing photos from community workshps&quot; /&gt;
&lt;em&gt;Mann shared lessons from hands-on data workshops with library patrons&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Mann’s project exemplifies academic-community collaboration done right. The partnership between University of South Florida, the public library system, and a philanthropic health equity organization created something none could achieve alone. The workshop structure was brilliantly simple: introduce a concept, share a local dataset demonstrating it, bring in a local person to discuss real-world application, then run group activities. Her &lt;a href=&quot;https://www.usf.edu/arts/news-archive/2023-24-news/20240220-crescendo-interdisciplinary-team-of-faculty-and-students-turn-environmental-impact-data-into-music.aspx&quot;&gt;red tide sonification&lt;/a&gt; example from South Florida shows how hyperlocal data becomes immediately relevant when paired with familiar environmental experiences.&lt;/p&gt;

&lt;p&gt;One insight that stood out: attendees came with vastly different question types, from basic to nuanced. Mann’s library background prepared her for meeting people at all skill levels, but it reinforced how “data literacy” isn’t a single skill—it’s a spectrum of capabilities that communities develop over time. The flexibility lesson is crucial for similar pilots: stay responsive to what people actually need rather than predetermined curricula.&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;keynote-clara-jiménez--david-fernández-sancho&quot;&gt;Keynote: Clara Jiménez &amp;amp; David Fernández Sancho&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Maldita.es Foundation&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;Clara Jiménez Cruz is the co-founder and CEO of the Maldita.es Foundation, a leading organization in the fight against misinformation… David Fernández is the current CTO of the Maldita.es Foundation, where he leads the development of innovative digital tools to combat disinformation.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/csvconf9-maldita1.JPG&quot; alt=&quot;photo of a projected slide showing photos from community workshps&quot; /&gt;
&lt;em&gt;The team shared a timeline of work they’ve done&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;This keynote hit hardest, perhaps because of recent events. Jiménez and Fernández positioned disinformation not as isolated false information, but as structured narratives—emotionally engaging, rationally grounded, amplified by trusted figures, global in scope. &lt;a href=&quot;https://maldita.es/clima/20241104/dams-reservoirs-removed-floods-valencia/&quot;&gt;Their Valencia floods case study&lt;/a&gt; was chilling: the same narrative emerging simultaneously from Russian sources, far-right groups, and anti-vaxxers, all attacking institutional trust. The message was consistent: “You can’t trust government, trust me.”&lt;/p&gt;

&lt;p&gt;Their AI-assisted approach uses Wikidata entity extraction to cluster claims, with human-in-the-loop narrative association. They’re tracking 600+ claims monthly—a scale that demands automation while recognizing AI’s limitations in understanding social context.&lt;/p&gt;

&lt;p&gt;The most powerful moment was Jiménez’s declaration: “data as a means of resistance.” In an environment where “the internet is becoming more dangerous on purpose,” systematic documentation and analysis of disinformation patterns becomes a form of civic defense.&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;data-without-borders-straight-out-of-spreadsheets-and-into-the-streets&quot;&gt;Data without borders: Straight out of spreadsheets and into the streets&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Michael Brenner (He/Him/His), Data4Change&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;At Data4Change, we work with communities around the world to explore how data can be collected, understood and shared in radically different ways. This talk takes you through a series of surprising, community-led projects that didn’t begin with data, but with burning questions our community partners wanted to find answers to.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Brenner’s examples were extraordinary—from &lt;a href=&quot;https://www.informationisbeautifulawards.com/showcase/7446-equal-victims-the-universal-reality-of-intimate-partner-violence&quot;&gt;addressing intimate partner violence in Uganda’s LGBTQ community&lt;/a&gt;, to &lt;a href=&quot;https://www.data4chan.ge/our-work/hear-the-blind-spot&quot;&gt;flute music rendering Ethiopian census data&lt;/a&gt; audible to the blind. Each project started not with data, but with burning community questions. I mention many of them in my Community Data book.&lt;/p&gt;

&lt;p&gt;The &lt;a href=&quot;https://www.data4chan.ge/our-work/life-under-curfew&quot;&gt;data murals project with Haki Data Lab in Mathare&lt;/a&gt; particularly resonated. Rather than imposing external visualization methods, they recognized murals as a standard local communication mode, then created input visualization murals with string and stencils. The “power portraits” methodology for the Equal Victims project was brilliant—co-created visual representations that function simultaneously as art and data points about lived experience, avoiding retraumatization while enabling systematic documentation.&lt;/p&gt;

&lt;p&gt;Brenner’s language shift is important: moving from “data literate” toward “data curious, data critical, and data confident.” It reframes the relationship from deficit-based (lacking literacy) to asset-based (developing capabilities).&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;keynote-fifteen-years-into-the-open-data-movement&quot;&gt;Keynote: Fifteen Years into the Open Data Movement&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Giorgia Lodi &amp;amp; Andrea Borruso&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;Fifteen years into the open data movement, we take the stage as an unusual duo to reflect, provoke, and laugh about where we’ve landed. A dialogue, shaped by friendship, a few successes, many failures, and a bit of fun—think less keynote, more late-night talk show.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;This closing keynote was part celebration, part wake-up call. Lodi and Borruso’s Italian open data examples were both hilarious and heartbreaking—ministry electric charging station data published with beautiful graphs but completely locked down, forcing community scraping to make it actually useful. Their analysis cuts deep: following the letter of the law isn’t enough when datasets lack description, remain readable only by specialists, or simply don’t exist (particularly around gender-based violence). Communities get pushed out by technicalities, formalism, and missing data.&lt;/p&gt;

&lt;p&gt;A case study about electric car data exemplifies the problem—officially “open” data published as unusable pivot tables instead of raw, machine-readable records. When the ministry finally released proper data, it came with semicolon separators and complete messiness. But their vector geography data example shows what’s possible: despite delivery delays and nested zip file complexity, raw land ownership data triggered new software, plugins, and services precisely because it was genuinely machine-readable.&lt;/p&gt;

&lt;p&gt;Their conclusion resonated throughout the conference: exploit technology to support data management, think about machine-to-machine AND machine-to-human AND machine-to-community interfaces, and recognize that technical solutions alone can’t address fundamentally social challenges.&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;reflections-data-as-community-practice&quot;&gt;Reflections: Data as Community Practice&lt;/h2&gt;

&lt;p&gt;Walking through Bologna after two days of talks, I kept thinking about the accordion player who inadvertantly soundtracked the opening day. There’s something fitting about that mix of structure and improvisation, tradition and innovation. These presentations collectively argue for a fundamental shift in how we approach data work in and with community. We must shift from extraction to engagement, from literacy deficits to community assets, from technical solutions to socio-technical systems. Whether it’s BOTO’s co-designed news delivery, Data4Change’s participatory data projects, or Open Data Editor’s community-driven development, the most compelling projects start with community questions rather than data capabilities.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/csvconf9-me.jpg&quot; alt=&quot;photo of me speaking with a slide about my book behind&quot; /&gt;
&lt;em&gt;I gave a keynote related to my _Community Data_ book (source: &lt;a href=&quot;https://bsky.app/profile/mcx83.bsky.social/post/3lykdmlb7mc23&quot;&gt;Marco Cortella&lt;/a&gt;)&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The CSV theme threading through many talks isn’t just about file formats—it’s about accessibility, sustainability, and democratic participation in data practices. In a world increasingly dominated by complex AI systems and proprietary platforms, there’s something radical about insisting that the most important data work can happen with simple, open, maintainable tools. As disinformation accelerates and institutional trust erodes, these community-centered data practices feel more urgent than ever. The question isn’t just how we make data more accessible, but how we make data work serve community needs rather than institutional convenience.&lt;/p&gt;
</description>
        <pubDate>Mon, 29 Sep 2025 15:00:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2025/09/29/csvconf9.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2025/09/29/csvconf9.html</guid>
        
        
        <category>data-literacy</category>
        
        <category>journalism</category>
        
        <category>civic-engagement</category>
        
        <category>events</category>
        
        <category>design</category>
        
      </item>
    
      <item>
        <title>Protest Mapper: a new tool for journalists covering protests</title>
        <description>&lt;p&gt;As protests against the Trump administration spread, journalists are covering local rallies — but putting them in context of larger movements can be a challenge. To help, I built a simple tool that lets you search for recent protests near you and create an embeddable map you can use in your reporting. &lt;a href=&quot;https://dataculture.northeastern.edu/local-protest-map/&quot;&gt;Try out Protest Mapper&lt;/a&gt;.&lt;/p&gt;

&lt;iframe title=&quot;Protest Mapper&quot; aria-label=&quot;Map of 171 local protests&quot; id=&quot;local-protest-mapper-embed&quot; src=&quot;https://dataculture.northeastern.edu/local-protest-map/?v=1&amp;amp;s=ACLED&amp;amp;c=-71.11999511718751%2C42.40115038362433&amp;amp;z=10&amp;amp;r=20&amp;amp;sd=2025-01-01&amp;amp;ed=2025-05-02&amp;amp;w=700&amp;amp;h=350&amp;amp;i=pin&amp;amp;t=1&amp;amp;m=alidade-smooth&amp;amp;a=&quot; width=&quot;700&quot; height=&quot;430&quot; frameborder=&quot;0&quot; scrolling=&quot;no&quot; data-external=&quot;1&quot; style=&quot;border: none;&quot;&gt;&lt;/iframe&gt;
&lt;p&gt;&lt;em&gt;A map of the more than 400 protests in eastern Massachusetts since the start of the year.&lt;/em&gt;&lt;/p&gt;

&lt;h2 id=&quot;media-attention-to-protests&quot;&gt;Media Attention to Protests&lt;/h2&gt;

&lt;p&gt;One of the responses to the nationwide April 5 “Hands Off” and May 1 Mayday rallies was shock at the perceived lack of coverage from mass media outlets. Critics pointed to &lt;a href=&quot;https://www.bostonglobe.com/2025/04/07/opinion/letters-to-the-editor-hands-off-protest-coverage/&quot;&gt;a lack&lt;/a&gt; of both &lt;a href=&quot;https://newrepublic.com/article/193683/print-media-downplay-mass-protests&quot;&gt;print&lt;/a&gt; and &lt;a href=&quot;https://thefederalist.com/2022/07/15/heres-why-the-media-dont-want-you-to-know-about-the-massive-protests-going-on-around-the-globe/&quot;&gt;digital coverage&lt;/a&gt;, arguing that it didn’t reflect the sheer volume of people that came out on the streets.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/protest-mapper-headlines.png&quot; alt=&quot;screenshots of various headlines critiquing media coverage of the April 5 “Hands Off” protests&quot; /&gt;
&lt;em&gt;A sample of headlines critiquing media coverage of the April 5 “Hands Off” protests.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;At the same time, local and regional media did show a noticeable bump in coverage. A query for articles mentioning both “hands off” and “protest” in the &lt;a href=&quot;https://search.mediacloud.org/&quot;&gt;Media Cloud online news archive&lt;/a&gt; shows almost 400 matching stories on the 5th.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/protest-mapper-chart.png&quot; alt=&quot;a line chart of media attention from Media Cloud, showing a peak on April 5th&quot; /&gt;
&lt;em&gt;A Media Cloud query for “‘hands off’ AND ‘protest’” shows a peak of coverage in state and local papers on April 5.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Yet it appears that local reporters are covering protests in their area, but not often connecting them to larger movements. That might be because coalitions like &lt;a href=&quot;https://www.fiftyfifty.one/&quot;&gt;#50501&lt;/a&gt; aren’t as well known as unions and long-standing activist groups; they don’t have communications people with long-standing relationships to journalists.&lt;/p&gt;

&lt;h2 id=&quot;finding--evaluating-a-public-protest-dataset&quot;&gt;Finding &amp;amp; Evaluating a Public Protest Dataset&lt;/h2&gt;

&lt;p&gt;One approach to help reporters make those links for readers, and put individual events in a broader context, is to use data about local protests. Connecting this weekend’s rally to events over the last few weeks might connect dots for audiences that are seeing public displays of resistance. I wondered if I could quickly map protests in my area based on existing data sources.&lt;/p&gt;

&lt;p&gt;I first reviewed existing data sources to see what was available. A quick summary of what I found:&lt;/p&gt;
&lt;ul&gt;
  &lt;li&gt;The &lt;a href=&quot;https://www.gdeltproject.org/&quot;&gt;GDELT&lt;/a&gt; project appears to have massive scope and has a “Protests” event category, but I found it too hard to access and use, with many broken links and inconsistent information.&lt;/li&gt;
  &lt;li&gt;The &lt;a href=&quot;https://carnegieendowment.org/features/global-protest-tracker?lang=en&quot;&gt;Carnegie Protest Tracker&lt;/a&gt; seems to focus on broad movements, not individual events.&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;https://usprotests.liveuamap.com/&quot;&gt;LiveUaMap&lt;/a&gt; seems well established, with lots of social media results, but I found the interface challenging to use and they charge for access.&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;https://www.amnestyusa.org/protest-map-launch/&quot;&gt;Amnesty International’s&lt;/a&gt; Protest Map focuses on police violence in response.&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;https://acleddata.com/&quot;&gt;ACLED&lt;/a&gt; (Armed Conflict Location &amp;amp; Event Data) appears to be a well staffed non-profit that makes data easily available, updated once a week.&lt;/li&gt;
  &lt;li&gt;The Harvard Ash Center &lt;a href=&quot;https://ash.harvard.edu/programs/crowd-counting-consortium/#data&quot;&gt;Crowd Counting Consortium&lt;/a&gt; (CCC) appears to be an academic project that is updated once a month.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This led me to focus on ACLED and CCC, since they are both regularly updated and easily accessible/usable. Each includes a list of protests and related manually curated metadata. They include protests about any topic.&lt;/p&gt;

&lt;p&gt;My next goal was to evaluate these datasets. One metric that came to mind was to search for media coverage about protests in a fixed place and time, and then to compare that to the datasets at hand. I decided on Massachusetts in February, a context I know well, and pulled datasets from both Media Cloud and ACLED. To find media stories I queried Media Cloud for relevant anti-Trump/Musk protests with this query:&lt;/p&gt;

&lt;div class=&quot;language-plaintext highlighter-rouge&quot;&gt;&lt;div class=&quot;highlight&quot;&gt;&lt;pre class=&quot;highlight&quot;&gt;&lt;code&gt;article_title:(
(protest* OR rally OR strike OR boycott OR walkout OR demonstrat* OR vigil)  
AND 
(Musk OR Trump OR Tesla OR closure OR federal OR university OR Israel OR Palestine OR cuts OR speech OR student OR research OR congress OR ICE OR agency OR USAID OR NOAA OR NSF OR NIH)
) and language:en
&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This discovered 76 related stories. I then coded them by hand to indicate if they were about protests in Massachusetts, identifying 41 stories about anti-Trump protests in the state from Media Cloud. In parallel, I pulled February data for the US from ACLED and filtered by the &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;admin1&lt;/code&gt; column to just Massachusetts events, identifying 51 entries in their database. To evaluate the two, we then reviewed every story in the Media Cloud corpus to identify which ACLED event it was about (linking it by the unique &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;event_id_cnty&lt;/code&gt; provided from ACLED). Just two protests were in the Media Cloud dataset that did not have a related ACLED entry, both small small events reported by university student-run news sites. 95% of the sample I pulled from the basic Media Cloud query were already in ACLED, leading me to believe it was a trustworthy source for a list of protests. You can &lt;a href=&quot;https://docs.google.com/spreadsheets/d/1KWI7kUK0ZLZJs2SJl5FulZX_sKJRzhqO0CJPJvbzPL4/edit?usp=sharing&quot;&gt;review the coded evaluation data&lt;/a&gt;.&lt;/p&gt;

&lt;h2 id=&quot;a-mapping-solution&quot;&gt;A Mapping Solution&lt;/h2&gt;

&lt;p&gt;Dataset in hand, I decided to create a simple tool that would allow journalists to embed maps of recent local protests in their stories. The map at the top of this article is one example. Visit https://dataculture.northeastern.edu/local-protest-map/ to try it out yourself. You can filter by location, timeframe, and customize the map a little.&lt;/p&gt;

&lt;p&gt;Tools like these require ongoing support, so I’ve made some decisions to try and minimize maintenance while maximizing utility. Here is a list of design decisions, concerns, and commitments:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;I’ve mimicked the well known &lt;a href=&quot;https://www.datawrapper.de/&quot;&gt;DataWrapper workflow&lt;/a&gt; to try and make it familiar and usable for data journalists. At the end you get an embed code (HTML iFrame) you can copy and paste into your webpage, just like I did above&lt;/li&gt;
  &lt;li&gt;This is deployed as a static HTML/CSS/JS site with no database or server compute power required, written with the open-source &lt;a href=&quot;https://svelte.dev/&quot;&gt;Svelte UI framework&lt;/a&gt;.&lt;/li&gt;
  &lt;li&gt;Some protests are marked in the exact same spot, for instance in the center of a city. This makes them very hard to see on the map. For these events I’ve a small random bump to the location in order to make them more visible. That is noted in the popup for impacted markers.&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;https://github.com/dataculturegroup/local-protest-map&quot;&gt;The code is open source on GitHub&lt;/a&gt; for full transparency.&lt;/li&gt;
  &lt;li&gt;I’m hosting the tool on GitHub Pages so I don’t have any ongoing expenses.&lt;/li&gt;
  &lt;li&gt;IFrame embeds can be tricky and limited, so the tool also offers the option to download a static image of the map in .png format.&lt;/li&gt;
  &lt;li&gt;You can pick between the &lt;a href=&quot;https://acleddata.com/&quot;&gt;ACLED&lt;/a&gt; (updated weekly) and &lt;a href=&quot;https://ash.harvard.edu/programs/crowd-counting-consortium/&quot;&gt;CCC&lt;/a&gt; (updated monthly) data. For now I plan to manually pull the updated data each week, but hope to automate that later.&lt;/li&gt;
  &lt;li&gt;I’ve used a light background map base layer from &lt;a href=&quot;https://ash.harvard.edu/programs/crowd-counting-consortium/&quot;&gt;Stadia Maps&lt;/a&gt;. This means it can’t support more than 200,000 hits across all uses. If this somehow magically becomes very popular I can easily change that via the &lt;a href=&quot;https://leafletjs.com/&quot;&gt;Leaflet mapping library&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Give it a try, share it, and let us know what you find in the comments. Hopefully this tool and explanation will help you and others include data about the wave of protests to your reporting and add context to individual events you’re covering.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;&lt;a href=&quot;https://www.storybench.org/try-this-tool-to-map-protests-for-local-news/&quot;&gt;Originally published on Storybench&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;
</description>
        <pubDate>Thu, 08 May 2025 15:00:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2025/05/08/protest-mapper.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2025/05/08/protest-mapper.html</guid>
        
        
        <category>ai</category>
        
        <category>journalism</category>
        
        <category>civic-engagement</category>
        
      </item>
    
      <item>
        <title>AI for Good: Five Key Principles for the Pro-Social Sector</title>
        <description>&lt;p&gt;Artificial Intelligence (AI) is reshaping industries across the board, but its impact in the pro-social sector is worth digging into more to understand emerging norms and challenges. Our typical examples stem from years of work on “civic tech”, “public interest tech”, “data science for good”, and related efforts. Most of these focus on challenges like &lt;a href=&quot;https://www.wildme.org/#/wildbook&quot;&gt;species conservation&lt;/a&gt;, &lt;a href=&quot;https://promedmail.org&quot;&gt;disease outbreak surveillance&lt;/a&gt;, &lt;a href=&quot;https://www.globalforestwatch.org/&quot;&gt;deforestation tracking&lt;/a&gt;. Who doesn’t want to save threatened zebras? I describe these as &lt;strong&gt;accepted good&lt;/strong&gt;: problems that most people agree need to be solved.&lt;/p&gt;

&lt;p&gt;But there are problems less people agree are for the common good. What about AI projects that support for gender equity? Or projects working against surveillance capitalism? Not everyone agrees that these are projects that support the &lt;em&gt;common&lt;/em&gt; good (though I do). These could be collectively called &lt;strong&gt;AI for &lt;em&gt;contested&lt;/em&gt; good&lt;/strong&gt;, and that’s the concept I want to dig into. Specifically, what are some key principles and methods to use when designing AI projects that support work on &lt;em&gt;contested good&lt;/em&gt; in the pro-social sector.&lt;/p&gt;

&lt;p&gt;This post introduces that thinking and some related examples. This line of thinking stems from an invited talk I gave recently at the &lt;a href=&quot;https://ai.utsa.edu/ai-spring-school-2024/&quot;&gt;UTSA MATRIX consortium’s AI Spring School&lt;/a&gt;, which offered an opportunity for me to spend a little more time conceptualizing and formalizing thoughts about these issues for the highly technical audience.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/utsa-data-for-good-selfie.jpg&quot; alt=&quot;selfie from the stage showing my face in the corner and the audience in the room&quot; /&gt;
&lt;em&gt;I took a quick selfie from the stage while the audience was doing a quick chat about projects they would describe as “AI for good”&lt;/em&gt;&lt;/p&gt;

&lt;hr /&gt;

&lt;h3 id=&quot;1-center-underserved-issues&quot;&gt;1. Center Underserved Issues&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Instead of focusing on well-studied and understood problems, turn your attention to innovate in neglected areas. This presents an opportunity for significant social impact.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This is particularly relevant in the context of feminicide, where mainstream institutions fail to collect and analyze data effectively. As Catherine D’Ignazio reminds us in &lt;a href=&quot;https://mitpress.mit.edu/9780262048873/counting-feminicide/&quot;&gt;“Counting Feminicide”&lt;/a&gt;, &lt;em&gt;“What isn’t counted doesn’t count.”&lt;/em&gt; Over the last few years she’s been working with groups that create “counterdata” on feminicide to fill gaps in official records on gender-related killings of women, girls (cisgender and transgender). The work of these civil society groups to produce datasets about this violence supports advocating for policy, enforcement, and garnering public attention.&lt;/p&gt;

&lt;p&gt;Producing this data is hard, both logistically and in regard to the human toll. Global groups struggle to create datasets of gender-related killings with ill-fitting tech. This creates significant data gaps and emotional trauma. As &lt;a href=&quot;https://mitpressonpubpub.mitpress.mit.edu/pub/cf-chap3/release/2#nwzl6xiaacr&quot;&gt;one partner noted&lt;/a&gt;,&lt;/p&gt;
&lt;quote&gt;It is not easy to read ten cases of feminicide and put them on a table, disaggregate them, have to put a name, age, circumstances and all that detail, without it affecting you emotionally.&lt;/quote&gt;

&lt;p&gt;In response, I’m part of the &lt;a href=&quot;https://counterdatanetwork.org/&quot;&gt;the Counterdata Network&lt;/a&gt; that works with groups to create bespoke machine learning classifiers that score news stories, cluster them by events, and deliver via email/web. Over 30 groups use them daily to monitor human rights and civil rights violations via news reports. We co-designed the system to help, basically replacing google news alerts with a custom-built open-source data pipeline. The network is growing across multiple topics: feminicide, criminalization of pregnancy, business human rights violations, and more.&lt;/p&gt;

&lt;p&gt;This type of work emphasizes the need to look for issues that are systematically ignored and listen to voices that are unheard or silenced. Innovate in neglected areas with higher potential for social impact.&lt;/p&gt;

&lt;h3 id=&quot;2-build-with-not-for&quot;&gt;2. Build With, Not For&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Technology should not be imposed on communities but co-created with them to ensure relevance, impact, and empowerment. This participatory approach contrasts with the traditional top-down model of technology deployment.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I first heard this phrase “build with, not for” from Catherine Bracy during her time at Code for America. She used it to emphasize the idea of co-creating technologies with government and community, instead of creating top-down solutions. As technologists, even our language is embedded with the opposite approach – consider the ideas of “deploying” vs. integrating” tools.&lt;/p&gt;

&lt;p&gt;One response to challenges around Individualized Education Programs (IEPs) in the U.S. present a good government-related example to learn from. IEPs are plans created by public schools to offer special education services to children in need of more specialized care. They are created collaboratively across the school system, teachers, and parents. These legally mandated plans are in use for around 8 million students across the US. However, many parents struggle to understand the services their child is offered in their complex IEP.&lt;/p&gt;

&lt;p&gt;The &lt;a href=&quot;https://aiep.org/&quot;&gt;AI-EP&lt;/a&gt; team used interviews to identify common concerns parents had about IEPs. Through interviews, the team identified key challenges and built an AI-powered translation engine and chatbot to help parents navigate complex documents in a personalized way. The features include translation, question answering, and more. This creates a new AI-supported method for helping parents understand a complex document that is written in legal language using complex terms.&lt;/p&gt;

&lt;p&gt;Projects like this remind us to focus on participatory design and iterative solution that integrate feedback from the impacted community. Co-creation ensures relevance, impact, and empowerment.&lt;/p&gt;

&lt;h3 id=&quot;3-study-up&quot;&gt;3. Study Up&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;AI can be a powerful tool for interrogating power structures, flipping traditional methods to focus on the culture of power rather than the culture of the powerless.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Anthropologist &lt;a href=&quot;https://archive.org/details/ERIC_ED065375&quot;&gt;Laura Nader’s concept of &lt;em&gt;“studying up”&lt;/em&gt;&lt;/a&gt; is essential here—using AI to investigate those with authority rather than those affected by their decisions.  She argued that the methods of traditional anthropology, which typically studied groups with less power in social structures, should instead be flipped to study mechanism of power like colonization. &lt;a href=&quot;https://web.archive.org/web/20200719100331/https://medium.com/swlh/studying-up-reorienting-the-field-of-algorithmic-fairness-around-issues-of-power-9968bfbacf8b&quot;&gt;Chelsea Barabas’ work on bringing this idea to algorithmic fairness&lt;/a&gt; has been highly influential on my thinking here.&lt;/p&gt;

&lt;p&gt;A striking recent example is the &lt;a href=&quot;https://landlordtechwatch.org/&quot;&gt;Landlord Tech Watch&lt;/a&gt; project. Landlords increasingly rely on automated screening tools that use criminal records, eviction histories, and credit scores to assess tenants—often reinforcing racial and economic discrimination. By simulating tenant reports, this project demonstrated how these technologies embed bias and provided advocacy groups with concrete evidence for policy change (&lt;a href=&quot;https://doi.org/10.1080/10511482.2022.2113815&quot;&gt;So, 2022&lt;/a&gt;). They are studying the tools of those with power in the rental market to understand disparate impacts and work to fight against them.&lt;/p&gt;

&lt;p&gt;This line of work studying up in AI systems identifies that we should prioritize identifying and interrogating power structures so that we can then flip traditional research methods to challenge systemic biases. We can leverage advanced AI tools to investigate those with power.&lt;/p&gt;

&lt;h3 id=&quot;4-look-for-quick-wins&quot;&gt;4. Look for Quick Wins&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Does every problem need a complex AI solution? Often, small, targeted interventions offer more sustainable impact than complex, resource-intensive projects.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Journalists are often at the cutting edge of leveraging new technologies, and considering the ethical implications for their field… genAI is no exception. Their work serves as a model for building small, targeted, interventions that offer quick wins with lightweight AI integrations.&lt;/p&gt;

&lt;p&gt;One compelling example is &lt;a href=&quot;https://ijnet.org/en/story/how-ai-chatbot-amplifying-stories-women-caught-paraguays-drug-trade&quot;&gt;the &lt;em&gt;Eva&lt;/em&gt; chatbot&lt;/a&gt; developed by the Paraguayan news outlet &lt;a href=&quot;https://elsurtidor.com/&quot;&gt;El Surtidor&lt;/a&gt;. Paraguay has many women in prison, or awaiting sentencing, for drug smuggling. The team wanted to open a door into the taboo issue, so they trained an AI chatbot based on interviews with women awaiting sentencing for international trafficking. Readers could interact with the bot to ask quesitons about their cases, how they got caught, and more (with appropriate guardrails). It quickly reached more than 15,000 interactions with the public. &lt;em&gt;Eva&lt;/em&gt; helps reframe the public narrative around drug-related crimes by challenging stereotypes and stigma.&lt;/p&gt;

&lt;p&gt;Quick-turn projects with tightly defined use-cases like this help us consider what kind of new stories with social impact we can create, focusing on lightweight, adaptable solutions that offer immediate impact. How can AI help you make impact on a small but well defined idea?&lt;/p&gt;

&lt;h3 id=&quot;5-consider-hidden-risks&quot;&gt;5. Consider Hidden Risks&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;GenAI technologies are built to look magical, and often produce good-looking results at first pass. Scholars are adopting them to support media research, without fully considering the very real &lt;a href=&quot;https://news.cornell.edu/stories/2023/04/study-uncovers-social-cost-using-ai-conversations&quot;&gt;social&lt;/a&gt;, &lt;a href=&quot;https://www.nytimes.com/2024/09/27/technology/openai-chatgpt-investors-funding.html&quot;&gt;financial&lt;/a&gt;, and &lt;a href=&quot;https://www.washingtonpost.com/technology/2024/09/18/energy-ai-use-electricity-water-data-centers/&quot;&gt;ecological impacts&lt;/a&gt;.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;More and more often I’m seeing genAI used for computational tasks that are well solved with regular ML: entity extraction, geographic ID, etc. There are a set of already defined problems that we don’t need genAI for. Ongoing work by academics and journalists are uncovering the &lt;a href=&quot;https://dl.acm.org/doi/abs/10.1145/3442188.3445922&quot;&gt;true costs of the massive investment in LLMs&lt;/a&gt;. Like &lt;a href=&quot;https://www.texastribune.org/2024/07/10/texas-bitcoin-mine-noise-power-grid-cryptocurrency/&quot;&gt;Bitcoin&lt;/a&gt;, with genAI there’s a very real future where we have blackouts because of computational power usage.&lt;/p&gt;

&lt;p&gt;An example is a recent study I worked on with a team from &lt;a href=&quot;https://thescopeboston.org/&quot;&gt;The Scope&lt;/a&gt;.  We wanted to analyze articles in the hyper-local news source to assess content diversity via geography, topic, and quotes. We could have used genAI for all of this, as others have, but for &lt;a href=&quot;https://hub.docker.com/r/rahulbot/cliff-clavin&quot;&gt;geography&lt;/a&gt; and &lt;a href=&quot;https://hub.docker.com/r/rahulbot/news-entity-server&quot;&gt;entities&lt;/a&gt; we used existing off-the-shelf solutions because they are well validated, produce reproduceable results, are free to deploy and use, and are open source (I’ve helped build both). However, without well-performing and easy to use existing solutions for quotes, we decided to try the genAI black box. We iterated on prompts against a manually coded list of quotes and ended up using ChatGPT v3.5, which was easy to integrate with, and cost little ($6 total). Quote extraction and attribution (via pronouns) worked well in the end, allowing us to do gender-based analysis. That was only possible at scale thanks to ChatGPT. In the end we were able to do a cross-sectional analysis along those three dimensions of content to help the editors audit their own content, and plan for any changes with that data in mind.&lt;/p&gt;

&lt;p&gt;Our work on that study shows how genAI integrations should be carefully designed to only be used where necessary. That can help reduce areas where results are not easily reproducible and justify work towards algorithmic interpretability to mitigate unintended consequences. Companies creating LLMs are deliberately obfuscating the hidden power and &lt;a href=&quot;(https://generative-ai-newsroom.com/the-often-overlooked-water-footprint-of-ai-models-46991e3094b6)&quot;&gt;environmental costs&lt;/a&gt;, and &lt;a href=&quot;https://www.niemanlab.org/2021/09/well-this-puts-a-nail-in-the-news-video-on-facebook-coffin/&quot;&gt;don’t have a history&lt;/a&gt; of being good partners to the media industry.&lt;/p&gt;

&lt;hr /&gt;

&lt;p&gt;The five principles outlined here—centering underserved issues, building with communities, studying up, looking for quick wins, and considering hidden risks—are foundational for using AI responsibly in the pro-social sector. If you’re interested in AI’s role in social impact work, I encourage you to explore these principles in your own projects. These kind of approaches and examples can help ensure that we model how AI technologies might serve as a tool for empowerment, rather than perpetuating existing inequalities.&lt;/p&gt;

</description>
        <pubDate>Wed, 26 Feb 2025 02:00:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2025/02/26/UTSA-AI-for-good.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2025/02/26/UTSA-AI-for-good.html</guid>
        
        
        <category>ai</category>
        
        <category>journalism</category>
        
        <category>civic-engagement</category>
        
      </item>
    
      <item>
        <title>Nebraska Data Art Jam: Beyond Visualization</title>
        <description>&lt;p&gt;I’m just back from delivering the keynote talk at the &lt;a href=&quot;https://arts.unl.edu/carson-center/data-storytelling/&quot;&gt;Data Storytelling Workshop and Data Art Jam&lt;/a&gt;, which brought together a fascinating mix of creative thinkers, from scientists to journalists to artists. Hosted by &lt;a href=&quot;https://arts.unl.edu/carson-center/person/dan-novy/&quot;&gt;Dan Novy&lt;/a&gt; at the &lt;a href=&quot;https://arts.unl.edu/carson-center/&quot;&gt;Johnny Carson Center for Emerging Media Arts&lt;/a&gt;, this interdisciplinary event highlighted new ways of communicating through data, bridging art, technology, and storytelling. Here’s a look at the insights and projects from the other speakers at the event:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/data-jam-novysan.JPG&quot; alt=&quot;photo of Prof. Novy on stage with a slide behind, holding a giant carved hammer that is labeled T on it&quot; /&gt;
&lt;em&gt;Novysan introducing the event by showing how the R Studio hammer makes everything look like a nail, leading to cookie cutter data vis results&lt;/em&gt;&lt;/p&gt;

&lt;hr /&gt;

&lt;h2 id=&quot;dr-laura-guertins-data-quilts&quot;&gt;Dr. Laura Guertin’s Data Quilts&lt;/h2&gt;

&lt;p&gt;&lt;a href=&quot;https://www.brandywine.psu.edu/person/laura-guertin&quot;&gt;Dr. Laura Guertin from PennState Brandywine&lt;/a&gt; opened the event with her deeply personal and innovative approach to data storytelling. As a trained marine geologist, she had spent much of her career in STEM education. But it was an object from her grandmother—a Kenmore sewing machine—that set her on a new path. Laura began quilting and soon realized that quilts could tell stories, including those based on data. In 2018, she joined a team working on “&lt;a href=&quot;https://www.youtube.com/watch?v=kvR5mx8Qg3k&quot;&gt;coastal optimism&lt;/a&gt;,” an initiative that aimed to counter the often bleak narratives of environmental decline (for me this evoked a &lt;a href=&quot;https://www.solutionsjournalism.org&quot;&gt;solutions journalism&lt;/a&gt; approach). Instead of focusing solely on doom and gloom, she highlighted solutions: open levees in Louisiana that balance storm protection with environmental exchange, and offshore Christmas tree pens used to reduce coastal erosion.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/data-jam-geurtin.JPG&quot; alt=&quot;photo of Dr. Guertin speaking on stage with slide behind showing a quilt&quot; /&gt;
&lt;em&gt;Dr. Guertin speaking about her data quilts&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Inspired by these stories, Laura returned home and created “&lt;a href=&quot;https://journeysofdrg.org/2021/06/30/full-collection-stitching-hope/&quot;&gt;Stitching Hope for the Louisiana Coast&lt;/a&gt;,” her first data quilt. The response was overwhelming, and she began exploring more ways to translate climate data into fabric arts. This relates to now-popular trends like temperature scarves—knitted or stitched rows of fabric, each representing a day’s temperature—and expanded the concept to include themes like air quality and rainfall.&lt;/p&gt;

&lt;p&gt;Her most recent work connected to a research expedition on the open ocean. Serving the role of education and outreach on the boat, Zoom interactions with kindergarteners about measuring cloud cover inspired a quilt about the related data they were producing on-board. She shared daily photos of the sky with the kids, and the kids collected their own data to compare. The quilt she later made when back on shore, using local South African fabrics, visually represented cloud cover with colorful trapezoids, connecting scientific observation to physical materials and local culture. Laura’s quilting projects demonstrate how creative approaches to data can evoke curiosity and make complex information more accessible to a wide variety of audiences. As she noted, quilts have a unique power to draw people in—they feel warm and inviting, not intimidating. &lt;a href=&quot;https://journeysofdrg.org/tag/sciquilt/&quot;&gt;Explore more of her work on Dr. Guertin’s website&lt;/a&gt;.&lt;/p&gt;

&lt;h2 id=&quot;matt-waite-on-data-journalism-with-ai&quot;&gt;Matt Waite on Data Journalism with AI&lt;/h2&gt;

&lt;p&gt;Later, &lt;a href=&quot;https://journalism.unl.edu/person/matt-waite/&quot;&gt;Matt Waite&lt;/a&gt;, a journalism professor at the University of Nebraska-Lincoln, took the stage with a compelling argument for using artificial intelligence in journalism—but only for the boring stuff. He described AI as “the dumbest intern you could have,” perfect for automating repetitive tasks while leaving the creative work to humans. During a recent sabbatical, &lt;a href=&quot;https://mattwaite.github.io/posts/a-simple-example-ai-agents-doing-journalism/&quot;&gt;Matt explored how generative AI could support data journalism&lt;/a&gt;. His project focused on using R to integrate with Google Gemini, automating the summarization of county-level data to create personalized, map-based visualizations. He emphasized how this could make journalism more efficient, particularly for local stories.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/data-jam-waite.JPG&quot; alt=&quot;photo of Prof. Waite speaking on stage with slide behind showing a some data journalism AI experiments&quot; /&gt;
&lt;em&gt;Prof. Waite showing a demo of his work in R Studio on data journalism augmentation with AI&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;However, as he demonstrated, genAI isn’t foolproof. When he added a fact-checking bot to audit the summaries it produced, it caught errors, such as incorrectly describing a zero value as “no change.” Asking it to add “local context” to headlines introduced even more opportunities for inaccuracies. For Matt, these missteps underscored the importance of starting small with AI. Rather than aiming for big, transformative projects, he advocated for using AI to handle the mundane tasks that free up human journalists to focus on what truly matters: creativity and critical thinking.&lt;/p&gt;

&lt;h2 id=&quot;robert-twomey-on-data-and-art&quot;&gt;Robert Twomey on Data and Art&lt;/h2&gt;

&lt;p&gt;The final presentation of the morning came from &lt;a href=&quot;https://roberttwomey.com/about/&quot;&gt;Robert Twomey&lt;/a&gt;, an artist and engineer whose work explores the intersection of data and art. Robert shared how his background in both science and painting informs his practice, with a focus on observation and representation. He drew inspiration from his MFA advisor, Natalie Jeremijenko, and projects like the “&lt;a href=&quot;https://www.bureauit.org/sbox/&quot;&gt;Suicide Box&lt;/a&gt;,” which used motion detection to document unquantified data on the Golden Gate Bridge. Robert pointed to how Jeremijenko’s work asks thought-provoking questions that are still core today: What counts as data? How can art challenge the ways we encounter and understand it?&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/data-jam-twomey.JPG&quot; alt=&quot;photo of Prof. Twomey on stage with slide behind showing data and painting&quot; /&gt;
&lt;em&gt;Prof. Twomey connecting the dots between traditional data representation and painting as observation&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;He showcased projects that ranged from data physicalizations to generative art. One example was &lt;a href=&quot;https://raaf.org/Electronic_Works/Grower/Grower_frames.html&quot;&gt;Sabrina Raaf’s &lt;em&gt;Translator II: Grower&lt;/em&gt;&lt;/a&gt;, which visualized CO₂ levels by “painting” them on a gallery wall. Another, &lt;a href=&quot;https://sensorium.ampd.yorku.ca/aeolian-traces/&quot;&gt;Joel Ong’s &lt;em&gt;Aeolian Traces&lt;/em&gt;&lt;/a&gt;, used human migration data to generate fan-driven wind in a physical installation. Robert’s own work included &lt;a href=&quot;https://roberttwomey.com/megahal-grandmommy/&quot;&gt;a chatbot he created in 2005&lt;/a&gt; based on interactions with his grandmother, who had been diagnosed with Alzheimer’s. The chatbot became an interactive data portrait, offering a poignant exploration of memory and technology. In another project, &lt;em&gt;&lt;a href=&quot;https://roberttwomey.com/garbage-cubes/&quot;&gt;Garbage Cubes&lt;/a&gt;&lt;/em&gt;, Robert compressed physical artifacts into sculptural representations of “irreversible data compression,” challenging traditional notions of how we store and interpret information.&lt;/p&gt;

&lt;p&gt;Throughout his talk, Robert emphasized the role of artists as experts in making things experiential. He pondered genAI’s ability to have explanatory power at all.&lt;/p&gt;

&lt;h2 id=&quot;my-own-take-on-data-beyond-the-visual&quot;&gt;My Own Take on “Data Beyond the Visual”&lt;/h2&gt;

&lt;p&gt;As the opener before these brilliant folks, I took the opportunity to challenge the audience to rethink how we engage with data in community settings. I laid some groundwork by focusing on how now-traditional data visualization methods, developed in fields like science and business, often fail to meet the needs of pro-social domains like museums, libraries, and grassroots activism. These settings require tools that prioritize participation, empowerment, and justice.&lt;/p&gt;

&lt;p&gt;Pulling from &lt;a href=&quot;https://communitydatabook.com&quot;&gt;my book&lt;/a&gt;, I highlighted examples like participatory data murals and civic data sculptures, to demonstrate how creative approaches can reflect data back to communities, fostering deeper connections and driving action. I also previewed work on &lt;em&gt;Data Drums&lt;/em&gt;, a climate change sonification composed for a Brazilian drum troop, which is set to debut this spring (more to come on here soon).&lt;/p&gt;

&lt;p&gt;Through evocative examples and practical insights, I urged the audience to go beyond default charts and graphs, activating the latent creative capacity within communities to make data work more inclusive, impactful, and joyful.&lt;/p&gt;

&lt;hr /&gt;

&lt;p&gt;Those talks set the scene for a hands-on 2nd half of the day, where groups spent hours designing and building data sculptures with copious amounts of craft materials, large and small, that Novy had supplied. The open space made collaboration easy, and groups dove in with impressive energy to the new activity.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/data-jam-collage.JPG&quot; alt=&quot;collage of photos showing people in an open work area constructing things out of craft materials of varying sizes&quot; /&gt;
&lt;em&gt;Collage of teams working on their data sculptures.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Overall, the Data Art Jam was a testament to the power of interdisciplinary collaboration. From quilts that tell stories of resilience, to AI tools that simplify journalism, to art that questions the very nature of data, the event showcased the endless possibilities for reimagining how we communicate information. It left attendees inspired to experiment, adapt, and push the boundaries of what data storytelling can achieve.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Note: this post was drafted from my personal notes into a narrative using genAI tools and then reviewed and editing by hand&lt;/em&gt;&lt;/p&gt;
</description>
        <pubDate>Tue, 14 Jan 2025 02:00:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2025/01/14/UNL-data-art-jam.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2025/01/14/UNL-data-art-jam.html</guid>
        
        
        <category>data</category>
        
        <category>design</category>
        
        <category>visualization</category>
        
        <category>journalism</category>
        
        <category>events</category>
        
        <category>data-literacy</category>
        
        <category>ai</category>
        
      </item>
    
      <item>
        <title>Journalism papers at IEEE Vis 2024: selected highlights</title>
        <description>&lt;p&gt;&lt;a href=&quot;https://ieeevis.org/&quot;&gt;IEEE Vis&lt;/a&gt; is the premiere global academic computation-focused visualization conference. Each year about a thousand people attend, presenting around 100 papers. Some of those touch on data visualization and storytelling in journalism settings, bringing deep technical thinking and research into a domain that connects to the practicing newsroom. In this post I thought I’d summarize a few selected papers I saw at IEEE Vis 2024 that might be relevant to data visualization experts and data storytellers who work in journalism.&lt;/p&gt;

&lt;hr /&gt;

&lt;p&gt;&lt;img src=&quot;https://ieeevis.b-cdn.net/vis_2024/paper_images_small/v-tvcg-20233287585_Image.png&quot; alt=&quot;diagram showing interconnections between journalism and visualization&quot; /&gt;
&lt;em&gt;Structural influce model from &lt;a href=&quot;https://faculty.cc.gatech.edu/~john.stasko/papers/tvcg23-journalism.pdf&quot;&gt;Fu &amp;amp; Stasko, 2024&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;In “&lt;a href=&quot;https://faculty.cc.gatech.edu/~john.stasko/papers/tvcg23-journalism.pdf&quot;&gt;More Than Data Stories: Broadening the Role of Visualization in Contemporary Journalism&lt;/a&gt;”, Yu Fu and John Stasko identify ways data visualization can expand its impact in journalism beyond storytelling. They helpfullly lay out a research agenda to push other academics to work at this intersection. It might be useful to data journalists to look at their list of newsroom values data visualization can support, and offer some language for working with academics (like me) to develop co-created research projects.&lt;/p&gt;

&lt;hr /&gt;

&lt;p&gt;&lt;img src=&quot;https://ieeevis.b-cdn.net/vis_2024/paper_images_small/w-future-1013_Image.png&quot; alt=&quot;heat map data visualization of flooded areas in Rio Grande do Sul&quot; /&gt;
&lt;em&gt;An example Brazilian news data visualization &lt;a href=&quot;https://ieeevis.b-cdn.net/vis_2024/pdfs/w-future-1013.pdf&quot;&gt;Brito &amp;amp; Ferreira, 2024&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;In “&lt;a href=&quot;https://ieeevis.b-cdn.net/vis_2024/pdfs/w-future-1013.pdf&quot;&gt;Visual and Data Journalism as Tools for Fighting Climate Change&lt;/a&gt;” authors Emily Brito and Nivan Ferreira use media coverage of the terrible floods in Rio Grande do Sul (Brazil) as a case study in how data journalism can be a tool for fighting disinformation and climate change. Key elements they focus on include pursuasive communication, risk communication, and challenges realted to literacy and misinformation. Their paper offers valuable insights and examples for data journalists working on climate catastrophes.&lt;/p&gt;

&lt;hr /&gt;

&lt;p&gt;&lt;img src=&quot;https://ieeevis.b-cdn.net/vis_2024/paper_images_small/v-tvcg-20243355884_Image.png&quot; alt=&quot;diagram of the protocol used in the study&quot; /&gt;
&lt;em&gt;A visual representation of the protocol used in the study by &lt;a href=&quot;https://arxiv.org/html/2401.05511v1&quot;&gt;Rogha et al, 2024&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;In “&lt;a href=&quot;https://arxiv.org/html/2401.05511v1&quot;&gt;The Impact of Elicitation and Contrasting Narratives on Engagement, Recall and Attitude Change with News Articles Containing Data Visualization&lt;/a&gt;” authors Milad Rogha , Subham Sah, Alireza Karduni, Douglas Markant, and Wenwen Dou explore the “you draw it” approach to interactive data storytelling. They experimentally explored the impacts of asking readeres for their opinions on attitude change, recall and overall engagement. They found that while showing showing contrasting visual data narratives did increase engagement, they didn’t improve recall nor change attitudes more. This kind of experimental research is critical to understanding how to build more effective data visuals to engage and impact the public.&lt;/p&gt;

&lt;hr /&gt;

&lt;p&gt;&lt;img src=&quot;https://ieeevis.b-cdn.net/vis_2024/paper_images_small/v-full-1446_Image.png&quot; alt=&quot;screenshot of slide from presentation with paper title&quot; /&gt;
&lt;em&gt;Intro slide about &lt;a href=&quot;https://arxiv.org/abs/2408.07483&quot;&gt;Wang et al, 2024&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The paper “&lt;a href=&quot;https://arxiv.org/abs/2408.07483&quot;&gt;Visualization Atlases: Explaining and Exploring Complex Topics through Data, Visualization, and Narration&lt;/a&gt;”, by Jinrui Wang, Xinhuan Shu, Benjamin Bach, and Uta Hinrichs, offers a taxonomy of “atlases” of data. This might be particularly useful to data journalists creating exploratory interfaces that let the reader browse through large datasets related to (more explanatory) stories. They found that atlases tend to combine data, narration, and structured navigation to make complex topics accessible. Speficially, many use design templates and careful data curation for consistency and clarity in visual storytelling. Their catalog of these templates and approaches in the paper might help others who are launching projects that create atlases like they describe.&lt;/p&gt;

&lt;p&gt;I’m sure there were others that have relevant findings for folks working with visualizaton in newsrooms, but these were the ones that jumped out to me.&lt;/p&gt;
</description>
        <pubDate>Fri, 03 Jan 2025 05:09:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2025/01/03/ieee-vis-20204-journalism.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2025/01/03/ieee-vis-20204-journalism.html</guid>
        
        
        <category>data</category>
        
        <category>design</category>
        
        <category>visualization</category>
        
        <category>journalism</category>
        
      </item>
    
      <item>
        <title>Talk Summary: Dr. Vishwanath on Science Communication &amp; Misinformation</title>
        <description>&lt;p&gt;&lt;em&gt;Notes from a talk By Vish Vishwanath (Harvard School of Public Health; Dana-Farber Cancer Institute), hosted by David Lazar and the &lt;a href=&quot;https://idi.provost.northeastern.edu&quot;&gt;Internet Democracy Initiative&lt;/a&gt;, December 18, 2024.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The surge of misinformation in science and health communication is not just a problem of individuals making poor choices or failing to fact-check. It’s a systemic issue deeply embedded in the structures of our modern information ecosystem. In his talk, Vish Vishwanath, chair of the &lt;a href=&quot;https://www.nationalacademies.org/our-work/understanding-and-addressing-misinformation-about-science&quot;&gt;National Academies committee on “Understanding and Addressing Misinformation in Science,”&lt;/a&gt; argued that we need to move beyond individual blame and examine the structural forces that create and amplify misinformation. Here’s a quick summary of how he unpacked the problem—and what we can do about it.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/vishwanath-speaking.jpg&quot; alt=&quot;speaker in front of projected slide&quot; /&gt;
&lt;em&gt;Vishnawath speaking at the event&lt;/em&gt;&lt;/p&gt;

&lt;h2 id=&quot;the-problem-an-overloaded-information-ecosystem&quot;&gt;The Problem: An Overloaded Information Ecosystem&lt;/h2&gt;

&lt;p&gt;The modern information landscape is overwhelming. With more channels and a greater volume of content than ever before, even science communication and translation experts struggle to make sense of the noise. This crowded ecosystem provides fertile ground for disinformation and misinformation—false information spread with or without malicious intent. Vishwanath broke down the key players shaping today’s science and health communication landscape:&lt;/p&gt;
&lt;ul&gt;
  &lt;li&gt;&lt;strong&gt;Journalists:&lt;/strong&gt; Traditional media outlets, along with local and minoritized press, play a major role in communicating science. The minoritized press, Vishwanath noted, often takes on an advocacy role, contrasting with the gatekeeping tendencies of mainstream media.&lt;/li&gt;
  &lt;li&gt;&lt;strong&gt;Activist groups and government officials:&lt;/strong&gt; These actors produce content to inform, persuade, or advocate, often serving specific agendas.&lt;/li&gt;
  &lt;li&gt;&lt;strong&gt;Social media platforms:&lt;/strong&gt; Platforms like WhatsApp allow anyone to reach massive audiences, often through viral video content. These low-barrier tools can amplify both accurate and false information.&lt;/li&gt;
  &lt;li&gt;&lt;strong&gt;The private sector:&lt;/strong&gt; Commercial industries, like tobacco or sugary beverages, spend billions to shape public perceptions, often promoting narratives that align with their profit motives. For instance, tobacco companies spend $10 billion annually on marketing, creating a communication imbalance compared to public health efforts.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Amid this sea of content, making informed choices about what to trust requires effort, which many individuals may not have the time, tools, or skills to devote.&lt;/p&gt;

&lt;p&gt;Misinformation spreads rapidly in this interconnected world, driven by factors such as:&lt;/p&gt;
&lt;ul&gt;
  &lt;li&gt;Social media algorithms that amplify sensational content, regardless of accuracy.&lt;/li&gt;
  &lt;li&gt;The rise of AI-driven deepfakes that blur the line between fact and fabrication.&lt;/li&gt;
  &lt;li&gt;Declining trust in institutions, which leaves people more susceptible to alternative, often incorrect, narratives.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;One illustrative example Vishwanath shared related to cancer prevention. Despite robust evidence that lifestyle changes can reduce cancer risk, many people believe that cancer is inevitable and that “everything causes it.” These misconceptions are perpetuated by social media spirals, where inaccurate ideas are reinforced and mainstreamed.&lt;/p&gt;

&lt;h2 id=&quot;inequalities-in-health-communication&quot;&gt;Inequalities in Health Communication&lt;/h2&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/vishwanath-model.jpg&quot; alt=&quot;model diagram showing influences on health, communication inequalities, and outcomes&quot; /&gt;
&lt;em&gt;Structural influce model from &lt;a href=&quot;https://www.researchgate.net/publication/357666235_Designing_Effective_eHealth_Interventions_for_Underserved_Groups_Five_Lessons_From_a_Decade_of_eHealth_Intervention_Design_and_Deployment/figures&quot;&gt;a related academic paper&lt;/a&gt; (CC BY 4.0)&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Inequalities in how people access, interpret, and act on health information are deeply rooted in education, race, and socioeconomic status, with significant consequences for public trust and health outcomes. Education levels and racial identity intersect to shape science literacy, with certain groups, like white evangelical populations, being among the least likely to get vaccinated due to cultural and informational divides. During the COVID-19 pandemic, reliance on online tools for vaccine scheduling highlighted the digital divide, leaving those without internet access at a disadvantage. Marginalized groups are often labeled as “hard to reach,” but Vishwanath reframed this as “hardly reached,” pointing to systemic failures in outreach and inclusion. These data gaps and inequities perpetuate disparities in health communication, reinforcing a cycle of exclusion and misinformation.&lt;/p&gt;

&lt;h2 id=&quot;structural-solutions-rebuilding-trust-through-community&quot;&gt;Structural Solutions: Rebuilding Trust Through Community&lt;/h2&gt;

&lt;p&gt;Vishwanath called for a shift from individual-focused solutions to systemic, community-driven approaches to combat misinformation effectively. By adopting an ecological perspective, efforts should target structural barriers such as limited access to technology and educational disparities, rather than placing the onus solely on individuals to “read critically.” Community-engaged research emerges as a key strategy, involving collaboration with communities to co-define problems and co-develop relevant solutions. Examples of this approach include the MassCONECT Portal in Massachusetts, which provides localized resources and training to help communities adapt public health messages, and SANCHAR in India, which has trained over 200 journalists to critically assess and communicate scientific information. These initiatives illustrate how building structural and community capacity can amplify accurate, evidence-based science communication.&lt;/p&gt;

&lt;p&gt;Ultimately, Vishwanath called for more participatory and inclusive approaches to science communication. Misinformation isn’t just an individual problem—it’s a structural challenge that requires systemic solutions. By leveraging community knowledge and resilience, we can create alternatives to misinformation and foster trust in evidence-based information. Vishwanath’s talk highlighted the urgency of rethinking how we produce, share, and consume science information. Only by addressing the root causes of misinformation can we build a healthier, more equitable information ecosystem.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Note: this post was drafted from my personal notes into a narrative using genAI tools and then reviewed and editing by hand&lt;/em&gt;&lt;/p&gt;
</description>
        <pubDate>Wed, 18 Dec 2024 05:09:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2024/12/18/vishwanath-health-comms-talk.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2024/12/18/vishwanath-health-comms-talk.html</guid>
        
        
        <category>journalism</category>
        
        <category>media-analysis</category>
        
      </item>
    
      <item>
        <title>Reflections on &apos;UN Human Security in the AI Era&apos; Event</title>
        <description>&lt;p&gt;I was honored to attend and speak at a recent UN event about how AI might be reshaping the contours of human security: &lt;a href=&quot;https://hdr.undp.org/content/hdro-host-human-security-30-colloquium-21-november-2024&quot;&gt;&lt;em&gt;Human Security at 30: New Security and Development Horizons in the Artificial Intelligence Age&lt;/em&gt;&lt;/a&gt; (hosted by the UN Human Development Report Office). It was fascinating to bring my experience with participatory AI into this setting and it’s focus on human security, which was new to me. It was even more intriguing to be cast as an “AI expert” in the room of development experts, many of them working on next year’s Human Development Report. I left feeling both buoyed by the amazing work and energy, but also dismayed by the feeling of inevitability in the room, and associated embracing of corporate AI narratives. &lt;strong&gt;New technologies we are calling “AI” pose ongoing and future threats to human security, but &lt;em&gt;we&lt;/em&gt; get to decide who, how, and for what these tools are built and used&lt;/strong&gt;; we can’t cede the space to Big Tech’s corporate cloud solutions. I want to share some reflections on the public morning session in this blog post.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/un-human-security-1.jpg&quot; alt=&quot;speaking man with short hair standing behind podium in front of banner with event name&quot; /&gt;
&lt;em&gt;Pedro Conceição, head of the UN Development Report Office, introducing the event&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The event was one in a series of “listening session” style events that inform the production of the annual &lt;a href=&quot;https://hdr.undp.org&quot;&gt;UN Human Development Report&lt;/a&gt;. Many of these center concepts from “human security”, which focuses on protecting individuals (rather than states) from threats such as poverty, inequality, and environmental risks. The idea emerged in the 1990s, driven by definitions from Japan and Canada. Unlike state-level security, which prioritizes borders and military strength, human security emphasizes freedom from fear, want, and indignity, tackling root causes of insecurity, and more. The key idea is that a people-centered approach is essential in addressing global challenges that don’t stop at national boundaries; a category that more and more of our growing challenges fit into, hence the event on human security and AI.&lt;/p&gt;

&lt;h2 id=&quot;existential-risks-in-the-ai-age&quot;&gt;Existential Risks in the AI Age&lt;/h2&gt;

&lt;p&gt;&lt;a href=&quot;https://www.tobyord.com&quot;&gt;Toby Ord&lt;/a&gt;, senior researcher at Oxford University and author of &lt;a href=&quot;https://theprecipice.com&quot;&gt;The Precipice&lt;/a&gt;, kicked off the first panel with a sobering keynote focused on the question of AI as an existential risk to our species. He argued that humanity’s greatest risks stem not from natural forces but from our own tools. Ord stressed that AI development is accelerating far beyond our readiness to manage its consequences. To him, industry leaders and &lt;a href=&quot;https://theelders.org/news/elders-and-future-life-institute-release-open-letter-calling-long-view-leadership-existential&quot;&gt;international figures&lt;/a&gt; have sounded the alarm, yet policy and public discourse lag behind. For Ord, safeguarding the future necessitates treating global AI regulation as a public good—a commitment as enduring and robust as the international frameworks governing nuclear weapons or climate change. I found his commentary to be provocative, because so often I find the “existential threat” narrative around AI to be a corporate smoke-screen that pushes attention away from concrete ongoing harms. I’m not worried about robot overloads controlling my fate; I’m worried about encoded discrimination in decision making, environment harms and exacerbation of power scarcity, impacts of the rampant spread of generated disinformation, and so on.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/un-human-security-2.jpg&quot; alt=&quot;screenshot of slide defining key elements of human security as noted in the 1994 Human Development Report&quot; /&gt;
&lt;em&gt;Tory Ord’s summary of human security&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The associated panel of experts highlighted AI’s dual potential to empower and endanger, emphasizing the urgent need for ethical governance and equitable development. Laura Chinchilla, former President of Costa Rica, noted the rise of AI-enabled cybercrimes like phishing and fraud, which have outpaced legal frameworks, and stressed the importance of ethical standards and human rights education to mitigate these harms. Turhan Saleh (UNDP) framed AI within a rapidly evolving global risk landscape, cautioning that while its benefits remain hypothetical and unequally distributed, its costs are already exacerbating inequalities. His arguments strongly reflected some of my own short-term concerns and priorities in this space. Daniella Darlington, a responsible AI advocate, called for investments in data governance, community stewardship, and workforce education in the Global South. She pointed to barriers like limited computational resources, insufficient training, and low public trust—challenges that align with my perspective on addressing present risks rather than speculating on future scenarios.&lt;/p&gt;

&lt;h2 id=&quot;restoring-trust-in-an-age-of-worry&quot;&gt;Restoring Trust in an Age of Worry&lt;/h2&gt;

&lt;p&gt;The second keynote was from &lt;a href=&quot;https://www.cylab.cmu.edu/directory/bios/singh-rita.html&quot;&gt;Prof. Rita Singh (Carnegie Mellon University)&lt;/a&gt;, who took a researcher’s lens to the opportunities and unsolved problems of AI development. Singh works on advanced tools that use voice in numerous ways, including profiling. She called for intentional, human-centered design principles that prioritize societal needs over purely technical ambitions, including calls for society to encode incredibly complex things such as social norms and ethical behaviors into constructs that AI researchers could then implement. I personally found this take quite challenging to engage with, full of language I’d label at first glance as techno-centrist and reductivist. Encode ethical rules in some machine understandable way is incredibly complex and daunting, hardly a simple pre-condition to check-off on the way to building ethical bots.&lt;/p&gt;

&lt;p&gt;Others on the panel shared how AI is increasingly embedded in everyday life, leading to concerns about disinformation, job displacement, and online safety. These feelings of distress and distrust are pervasive, with six out of seven people worldwide reporting insecurity in their lives. The discussion explored how the human security framework, which emphasizes freedom from fear, want, and indignity, could be reimagined to address AI’s challenges and restore trust.&lt;/p&gt;

&lt;p&gt;I was a participant on this panel, asked to address how citizens and community-based initiatives in data collection and technology development might help restore trust and enhance agency. I emphasized that AI’s future is not inevitable—&lt;strong&gt;we have the power to choose what to build&lt;/strong&gt;. By fostering empowerment and ownership in technology development, we can shift the narrative of insecurity toward one of control and agency. Technology systems are inherently socio-technical, and AI is often a marketing term masking this complexity. Centering local expertise and engaging stakeholders in problem definition is crucial for creating public-interest technologies that address meaningful challenges in particpatory and empowering ways.&lt;/p&gt;

&lt;p&gt;I shared examples from the &lt;a href=&quot;/projects/counterdata-network.html&quot;&gt;Counterdata Network&lt;/a&gt;, where we work withcivil society organizations to develop AI tools for monitoring news flows that surface and document human rights violations. This model demonstrates that AI doesn’t need to rely solely on cloud computing solutions in the Global North; it can be localized and aligned with grassroots needs and capacities. I sought to echo and amplify what I learned from my colleague &lt;a href=&quot;https://alessandrajungs.com&quot;&gt;Alessandra Jungs de Almeida&lt;/a&gt; on rethinking “securitization” through feminist perspectives, framing security not as abstract defense but as safeguarding the lives and dignity of women and girls. This reframing, I argued, could profoundly reshape how we approach AI’s role in supporting or threatening human security.&lt;/p&gt;

&lt;h2 id=&quot;towards-equitable-ai-futures&quot;&gt;Towards Equitable AI Futures&lt;/h2&gt;

&lt;p&gt;The event underscored AI’s power to exacerbate global divides if left unregulated. It risks deepening inequalities between the Global North and South, with wealthier nations monopolizing AI’s benefits while poorer nations grapple with its harms. Human security, as defined over the past three decades, must now adapt to include AI’s promise and perils. From empowering women in AI development to addressing the digital gender divide and ensuring equitable access to AI resources, the path forward demands global solidarity and a renewed commitment to ethical governance.&lt;/p&gt;

&lt;p&gt;The juncture that this event focused on offers an opportunity: to redefine security not as freedom from fear alone, but as the ability to thrive in an interconnected, AI-driven world. Will we seize the opportunity to take ownership of who gets to build AI, for which purposes, and with what processes? I was left believing that a human security-inspired approach to AI demands that of us.&lt;/p&gt;
</description>
        <pubDate>Sun, 01 Dec 2024 05:09:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2024/12/01/un-human-security-event.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2024/12/01/un-human-security-event.html</guid>
        
        
        <category>ai</category>
        
        <category>events</category>
        
      </item>
    
      <item>
        <title>Sculpting Data with Clay – A New Take on Data Physicalization</title>
        <description>&lt;p&gt;I’m a big fan of sculptural forms of data, particularly because I think they offer novel opportunities to think harder about the &lt;em&gt;material&lt;/em&gt; as an interface to, and representation of, the data. Two projects have popped onto my radar while researching for my forthcoming Community Data book, both related to 3D printing clay data sculptures. “ListeningCups” and “Rain Gauge” transform everyday data into tactile, functional objects. Both projects highlight how clay, as a natural material, can foster an emotional, sustainable, and hands-on interaction with data.&lt;/p&gt;

&lt;h2 id=&quot;listeningcups-tactile-data-stories&quot;&gt;ListeningCups: Tactile Data Stories&lt;/h2&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/clay-sculpture-1.png&quot; alt=&quot;collage of three photos of a 3D printed clay mug in different settings, with bumps on it representing sounds of different parts of the city&quot; /&gt;
&lt;em&gt;Collage showing the ListeningCups, extracted from their academic paper.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The &lt;a href=&quot;http://audreydesjardins.com/pdf/Desjardins-ListeningCups-DIS2019.pdf&quot;&gt;ListeningCups project&lt;/a&gt; is the result of a residecy with designers and artists Audrey Desjardins and Timea Tihanyi trying to represent the ambient sounds of city in mug form. Presented at &lt;a href=&quot;https://dis2019.com&quot;&gt;DIS 2019&lt;/a&gt; the paper covers their motivations to bring data closer to people’s lives via tactile approaches that might encourage reflection and personal connection. Physically, they’ve encoded soundscapes in various places around Seattle. Each bump represents a the volume of snippet of ambient sound, allowing users to “feel” the sound landscape of these everyday spaces. This concept of “data stories” showcases how art and data can blend to evoke imagination and memory.&lt;/p&gt;

&lt;p&gt;I particularly like their work here because it allows for casual interaction as an “&lt;a href=&quot;https://link.springer.com/chapter/10.1007/3-540-69706-3_4&quot;&gt;ambient data display&lt;/a&gt;”; you can just use the mug as a mug with an interesting texture, or you could choose to try and interrogate it via tactile and visual ways to dig into the data more. This kind of “craft data sculpture” presenting an intriguing form of bringing people and data together, allowing for more serendipitous data exploration. An intriguing wrinkle is the “data accidents” they point to from the physical printing process, reminding us that data representation isn’t always as exact as mythology and hype tells us. Their reflections particularly highlight their own relation to the object in cool ways, as a talking piece and as a mug. As a drinker of copious amounts of tea, I’d enjoy having one of these around to explore and reflect on.&lt;/p&gt;

&lt;h2 id=&quot;rain-gauge-visualizing-climate-data-with-sustainability-in-mind&quot;&gt;Rain Gauge: Visualizing Climate Data with Sustainability in Mind&lt;/h2&gt;

&lt;p&gt;&lt;img src=&quot;/static/img/posts/clay-sculpture-2.png&quot; alt=&quot;collage of four photos of 3D printed clay mugs with different types of bumps and patterns on them&quot; /&gt;
&lt;em&gt;Collage of test prints evaluating data encoding onto extrusion rate, pauses, and size of symbols&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Earlier this year a team of University of Minnesota researchers wrote a paper for IEEE Vis 2024 about explorations in creating “ceramic data physicalizations” depicting rainfall levels over the last 80 years – &lt;a href=&quot;https://osf.io/preprints/osf/3nyrq&quot;&gt;the Rain Gauge project&lt;/a&gt;. They had a particular focus on the circular use of the clay material itself. The project practiced sustainable 3D printing by reusing clay repeatedly, making it an eco-friendly alternative to plastics commonly used in 3D printing. This method supports a circular, zero-waste process.&lt;/p&gt;

&lt;p&gt;Their paper explores objects geometries, physical encoding approaches, and logistics constraints of the current clay printing machines they were using. Amusingly they documented the slow decay of the object itself when exposed to rain. Their conclusions focus on the circular use of the clay itself, beyond considerations related to the material as a medium for data sculptures.&lt;/p&gt;

&lt;h2 id=&quot;clay-as-a-medium-for-data&quot;&gt;Clay as a Medium for Data&lt;/h2&gt;

&lt;p&gt;Why clay? Unlike digital screens or synthetic materials, clay is flexible, recyclable, and rooted in a rich history of craft and art.  With each project, the inherent imperfections and natural qualities of clay make data physicalizations not only informative but also deeply personal and reflective of the specific tools used to manufacture them. This innovative use of clay adds a physical and emotional layer to the data that might function to enhance comprehension and personal connection.&lt;/p&gt;

&lt;p&gt;As data becomes more central to our lives, these clay-based projects remind us that data storytelling can be both meaningful and sustainable. Their work reminds me that I’d like to run a study comparing the same data representation in various types of mateiral, to hopefully learn about the affordances and limitations of some of these approaches for varying audiences.&lt;/p&gt;
</description>
        <pubDate>Tue, 19 Nov 2024 05:09:00 +0000</pubDate>
        <link>https://dataculturegroup.org/2024/11/19/clay-data-sculptures.html</link>
        <guid isPermaLink="true">https://dataculturegroup.org/2024/11/19/clay-data-sculptures.html</guid>
        
        
        <category>data</category>
        
      </item>
    
  </channel>
</rss>
