Data Science for Social Good

description12 papers

group11 followers

lightbulbAbout this topic

Data Science for Social Good is an interdisciplinary field that applies data analysis, machine learning, and statistical methods to address societal challenges, enhance public welfare, and inform policy decisions. It focuses on leveraging data-driven insights to create positive social impact and improve the quality of life in communities.

lightbulbAbout this topic

Key research themes

1. How can data science education be structured to prepare students effectively for social good applications?

This theme focuses on pedagogical approaches and curricular design in data science programs aimed at equipping students with the interdisciplinary skills, computational tools, and ethical frameworks necessary for tackling complex social challenges. It emphasizes integrating statistical theory, computational proficiency, real-world data complexity, and social context awareness into undergraduate and graduate education to nurture data scientists capable of impactful social good work.

EPSILON - Data Science for Social Good (DSSG) - Notes on the Video Lectures

by Christian Reinboth

2025

Key finding: EPSILON develops modular, flexible open educational resources designed to train diverse learners—from beginners to advanced practitioners—on foundational data science concepts, ethical considerations, project management, and... Read more

articleView Paper downloadDownload

Who Is Doing Computational Social Science? Trends in Big Data Research

by Katie Metzler

2017

Key finding: A large-scale survey reveals that although a significant portion of social scientists engage with big data research, many face barriers such as lack of programming skills and interdisciplinary collaboration experience. The... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What are the ethical frameworks and socio-technical challenges in applying data science to social good initiatives?

This theme examines the intersection of data science practices with ethical considerations, privacy concerns, and socio-technical system design to responsibly harness data for social good. It explores frameworks that integrate legal guidelines, public trust, and ethical principles to guide data projects, especially in government and population-level research, while acknowledging the balance between individual privacy and collective benefit.

A Position Statement on Population Data Science

by Ros Moran

2021, International Journal of Population Data Science

Key finding: Defines Population Data Science as the science of data about people and highlights critical challenges including balancing individual privacy with the public good and developing robust socio-technical systems. The paper... Read more

articleView Paper downloadDownload

What Does Information Science Offer for Data Science Research?: A Review of Data and Information Ethics Literature

by Brady D Lund

2024, Journal of Data and Information Science

Key finding: Provides a narrative review illustrating how information science contributes a humanistic, transdisciplinary perspective on data ethics, emphasizing bias, anti-discrimination, and professional codes. It presents information... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How are Data for Good programs designed and implemented within academic and community partnerships to effectively address social challenges?

This theme investigates the structure, operational models, and collaborative practices of university-hosted and community-based Data for Good initiatives. It highlights the management of interdisciplinary teams, project lifecycle considerations, partnerships with nonprofits and public organizations, and the translation of data science methods into actionable insights that advance social welfare and equity.

The Data for Good Growth Map: Decision Points for Designing a University-Based Data for Good Program

by Dharma Dailey and

2022, University of Washington eScience Institute

Key finding: Analyzes experiences from multiple international university Data for Good programs, identifying critical design decisions regarding program mission, student engagement, partner relations, and ethical considerations. The paper... Read more

articleView Paper downloadDownload

Care and the Practice of Data Science for Social Good

by Amanda H Meng and

2019, Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies

Key finding: Proposes integrating a 'logic of care' into Data Science for Social Good practices, based on empirical research with a community group advocating for affordable housing. It shows how care-oriented approaches across project... Read more

articleView Paper downloadDownload

Data Projects For "Social Good": Challenges And Opportunities

by Karsten Tolle

2023

Key finding: Surveys challenges unique to social good data projects such as ethical handling of sensitive data, stakeholder motivation, and cultural-political barriers, contrasting them with corporate data projects. It identifies... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Data Science for Social Good

Data for Good - La science des données au service des publics minorisés et des milieux communautaires

by Nadia Seraiocco

2026, Data for Good La science des données au service des publics minorisés et des milieux communautaires

descriptionView Paper arrow_downwardDownload

Harnessing Social Media for Real-Time Disease Outbreak Prediction: A Digital Epidemiology Approach

by Olumuyiwa Ayomide Olorunfemi

2025

The traditional epidemiological surveillance systems have proven effective over time but face significant delays due to dependency on hospitals, laboratories, and government databases. Moreover, underreporting and slow data collection... more

descriptionView Paper arrow_downwardDownload

EPSILON - Data Science for Social Good (DSSG) - Notes on the Video Lectures

by Christian Reinboth

2025

The EPSILON project - European Platform for Social Data Science Incubation, Learning, Operation and Network - aims to bridge the gap between data science and social good by fostering impactful, research-driven initiatives. Co-funded by... more

descriptionView Paper arrow_downwardDownload

Towards better social crisis data with HERMES: Hybrid sensing for EmeRgency ManagEment System

by Maurizio Tesconi

2025, Pervasive and Mobile Computing

People involved in mass emergencies increasingly publish information-rich contents in online social networks (OSNs), thus acting as a distributed and resilient network of human sensors. In this work, we present HERMES, a system designed... more

descriptionView Paper arrow_downwardDownload

Hybrid Crowdsensing

by Maurizio Tesconi

2025

Crowdsensing systems can be either participatory or opportunistic, depending on whether the user intentionally contributes data, or she simply acts as the bearer of a sensing device from which data is transparently collected. In this... more

descriptionView Paper arrow_downwardDownload

On the need of opening up crowdsourced emergency management systems

by Maurizio Tesconi

2025, AI & society

Nowadays, social media analysis systems are feeding on user contributed data, either for beneficial purposes, such as emergency management, or for user profiling and mass surveillance. Here, we carry out a discussion about the power and... more

descriptionView Paper arrow_downwardDownload

Hybrid Crowdsensing

by Maurizio Tesconi

2025, Proceedings of the 26th International Conference on World Wide Web Companion - WWW '17 Companion

descriptionView Paper arrow_downwardDownload

Measuring objective and subjective well-being: dimensions and data sources

by Maurizio Tesconi

2025, International Journal of Data Science and Analytics

Well-being is an important value for people’s lives, and it could be considered as an index of societal progress. Researchers have suggested two main approaches for the overall measurement of well-being, the objective and the subjective... more

descriptionView Paper arrow_downwardDownload

On the need of opening up crowdsourced emergency management systems

by Maurizio Tesconi

2025, AI & SOCIETY

descriptionView Paper arrow_downwardDownload

Predictability or Early Warning: Using Social Media in Modern Emergency Response

by Maurizio Tesconi

2025, IEEE Internet Computing

descriptionView Paper arrow_downwardDownload

The Mobile Territorial Lab: a multilayered and dynamic view on parents’ daily lives

by Chiara Leonardi

2024, EPJ Data Science

The exploration of people's everyday life has long been of interest to social scientists. Recent years have witnessed a growing interest in analyzing human behavioral data generated by technology (e.g. mobile phones). To date, a few... more

descriptionView Paper arrow_downwardDownload

Batman or the joker?

by Chang-Tien Lu

2024, SIGSPATIAL Special

The exponential growth of the urban data generated by urban sensors, government reports, and crowd-sourcing services endorses the rapid development of urban computing and spatial data mining technologies. Easier accessibility to such... more

Figure 1: Spatial Distributions of Different Categories of Crime With the ubiquitous deployment of the urban sensors and rapid growth of crowdsourcing technologies, urban computing and spatial data mining techniques begin to thrive in detecting and analyzing urban events. Among all categories of urban events, safety and security-related events should be treated as one of the most important ones without a doubt. The urban computing community has addressed important problems such as urban safety and crime prediction [6, 26, 9], safe route recommendations [33, 12], and threats detection [21]. However, the convenience and accessibility of such abundant urban and spatial data generated by the urban sensors, end- users, and city administrators put a spotlight on unethical issues such as biased datasets, biased algorithms, biased results, and compromised privacy. Such problems are rarely addressed by the researchers in the urban safety analysis fields. In this section, we summarize some of the pioneering research works in the urban safety analysis field, address the potential ethical issues, and then provide our visions on how to tackle and improve or mitigate the current research status of the ethical issues in the urban computing and spatial data mining fields.

depending on gender and age demographics. Urban recommendations could be given to any specific city if the data were available. Figure 1 shows the correlation between the crime distribution and the physical appearances of the city. In another study to explore the connection between urban perception and crime inferences, Liu et al. [26] present a unified framework to learn to quantify safety attributes of physical urban environments using crowd-sourced street-view photos without human annotations. A large-scale urban image dataset is collected in multiple major cities. Safety scores from the government’s criminal records are collected as objective safety indicators. A deep convolutional neural network is proposed to parameterize the instance-level scoring function. Figure 2 shows the structure of the proposed model. The method is capable of localizing interesting images and image regions for each place.

Silver line. Disruption 6, disruption 7 and disruption 8 occurred on the Blue line. MDDM model successfully detects disruptions 1, 4 and 6. Figure 3: A timeline of metro disruptions on the Orange, Blue, and Silver metro lines in 2015. Events along these spatially interconnected lines often co-occur Traffic Incident Detection Analysis with Social Media Summarization. Fu et al. [11] propose a social media-based traffic status monitoring system (Steds). The system is initiated by a transportation-related keyword generation process. Then an association rules based iterative query expansion algorithm is applied to extract real-time transportation-related tweets for incident management purposes. The feasibility of summarizing the redundant tweets to generate concise and comprehensible textual contents is confirmed.

descriptionView Paper arrow_downwardDownload

Segregated interactions in urban and online space

by Alfredo Morales

2023, EPJ Data Science

Urban income segregation is a widespread phenomenon that challenges societies across the globe. Classical studies on segregation have largely focused on the geographic distribution of residential neighborhoods rather than on patterns of... more

descriptionView Paper arrow_downwardDownload

Predictability or Early Warning: Using Social Media in Modern Emergency Response

by Andrea Marchetti

2023, IEEE Internet Computing

descriptionView Paper arrow_downwardDownload

Predictability or Early Warning: Using Social Media in Modern Emergency Response

by Carlo Meletti

2023, IEEE Internet Computing

descriptionView Paper arrow_downwardDownload

Data science for urban equity: Making gentrification an accessible topic for data scientists, policymakers, and the community

by Rachel Berney

2023, arXiv (Cornell University)

Figure 1: One view of the Equitable Futures visualization tool.

descriptionView Paper arrow_downwardDownload

Mobile phone data and geospatial information for official statistics and indicators

by Kenneth Iversen

2023

The advent of Big Data is having an important impact on the production and analysis of data, and is changing the environment within which the official statistical community operates. Spurred by the increased demands for timely and... more

descriptionView Paper arrow_downwardDownload

Data science for urban equity: Making gentrification an accessible topic for data scientists, policymakers, and the community

by Gundula Proksch

2023, arXiv (Cornell University)

descriptionView Paper arrow_downwardDownload

Visualizing Equity : A Data Science for Social Good Tool and Model for Seattle

by Gundula Proksch

2023

Our paper presents the products, preliminary findings, and methodology of the Equitable Futures project, an investigation into the active gentrification process and increasingly inequitable access to opportunities in Seattle. The project... more

descriptionView Paper arrow_downwardDownload

Mobile phone data and geospatial information for official statistics and indicators

by Ivo Havinga

2023

descriptionView Paper arrow_downwardDownload

1 Big Data and Happiness

by Stephanié Rossouw

2022

The pursuit of happiness. What does that mean? Perhaps a more prominent question to ask is, 'how does one know whether people have succeeded in their pursuit'? Survey data, thus far, has served us well in determining where people... more

descriptionView Paper arrow_downwardDownload

A case-study in cross-disciplinary student work: a CNC-manufactured body for FSAE racing

by Emmanouil Vermisso

2022

descriptionView Paper arrow_downwardDownload

One Project at a Time: Service and Learning Applied in Appalachian Communities

by D. Jason Miller

2022, Intersections Between the Academy and Practice

Against a geographically isolated and economically — if not culturally — impoverished backdrop, brothers B.B. and D.D. Dougherty founded Watauga Academy, a forerunner of Appalachian State University, in 1899 to provide educational... more

descriptionView Paper arrow_downwardDownload

Data science for urban equity: Making gentrification an accessible topic for data scientists, policymakers, and the community

by Gundula Proksch

2022, ArXiv

The University of Washington eScience Institute runs an annual Data Science for Social Good (DSSG) program that selects four projects each year to train students from a wide range of disciplines while helping community members execute... more

descriptionView Paper arrow_downwardDownload

Visualizing Equity: Learning from “Data Science for Social Good” in the Built Environment

by Gundula Proksch

2022, 106th ACSA Annual Meeting Proceedings, The Ethical Imperative

descriptionView Paper arrow_downwardDownload

Baroque Rome as Algorithm: Coding History

by Laura Terry

2022, 107th ACSA Annual Meeting Proceedings, Black Box

descriptionView Paper arrow_downwardDownload

Visualizing Equity: Learning from “Data Science for Social Good” in the Built Environment

by Rachel Berney

2022, 106th ACSA Annual Meeting Proceedings, The Ethical Imperative

descriptionView Paper arrow_downwardDownload

Predictability or Early Warning: Using Social Media in Modern Emergency Response

by Andrea Marchetti

2022, IEEE Internet Computing

descriptionView Paper arrow_downwardDownload

The Data for Good Growth Map: Decision Points for Designing a University-Based Data for Good Program

by Dharma Dailey and

2022, University of Washington eScience Institute

University-hosted Data for Good (D4G) programs provide research, service, and learning opportunities to students through team-based projects run outside of normal course offerings (typically during the summer). Popular with students, D4G... more

descriptionView Paper arrow_downwardDownload

Visualizing Equity : A Data Science for Social Good Tool and Model for Seattle

by Rachel Berney

2022

descriptionView Paper arrow_downwardDownload

Visualizing Equity: Learning from “Data Science for Social Good” in the Built Environment

by Rachel Berney

2022, 106th ACSA Annual Meeting Proceedings, The Ethical Imperative

and across the thematic cluster, second, it provided the team with a testable answer for which indicators were the most useful (i.e. most strongly correlated with one another) in answering questions about equity, and, third, with the model undergirding the web-based tool that lies at the heart of the project, stakeholders are able to run predictions by changing inputs on select criteria. In addition, they are able to view the results visually. Through these results, the team responded to all of its goals for the project. First, the project functions as a planning, education, research, and decision-making resource for designers, planner, nonprofit organizations, and research- ers. In reviewing similar reporting and decision-making tools the literature review found that those earlier projects were comparatively limited, displaying only static maps and/or mapping at single fixed spatial scale. Second, the project sup- ports user interactions with data at multiple scales across a variety of topic areas in spatial form. This will help users to probe data quality and look for bias in the data used. Third, the project supports analysis on the city level, as well as pro- vides the ability to zoom into census tract, neighborhood, and census block scales. The tool displays publicly available data at all of these scales primarily related to the team’s themes of housing, business and development, development, educa- tion, environment, health, income, and mobility (figure 4). The tool is capable of visually displaying analysis from a struc- tural equation model, including synthesized outcomes across

LESSONS LEARNED Figure 4: Structural equation model, (a) schematic diagram and (b) display of model score at the neighborhood scale

Equity Modeler assumptions and limitation of the project. In the case of the project described in this paper, transparency and replicabil- ity were at the forefront of the team’s concerns for creating a socially-responsible tool. DSSG usually develops tools for the supported organizations to empower them in their fur- ther data use. The mandate of transparency and use of open source software make these tools also available for other groups to adapt and use. Figure 5: Web-based tool, display of points of interest and geographical features on neighborhood scale

descriptionView Paper arrow_downwardDownload

Data science for urban equity: Making gentrification an accessible topic for data scientists, policymakers, and the community

by Rachel Berney

2022, ArXiv

descriptionView Paper arrow_downwardDownload

Public Space: Activation v. De-Activation

by Clifton Ellis

2022

Historically, the great cities of the world have built public spaces that have often been used as venues for spectacle, and displays of power and status. These public venues are part of the identity of these cities and have an importance... more

descriptionView Paper arrow_downwardDownload

Airscapes: Atmosphere as Form in Architecture/ No Molds but Modulators

by ophelia mantz

2022, New Instrumentalities

Atmosphere, atmospheric, or atmotopo attempts to capture a crucial cultural moment that weaves together different schemes of thought with myriad technologies of communication and visualization. The methods of representation are arguably... more

descriptionView Paper arrow_downwardDownload

The Campus as a Living Laboratory: Post-Occupancy Evaluation and a Digital Repository as a Teaching Tool

by Stacey White

2022, Intersections Between the Academy and Practice

In 2013-14, the California State University system funded 23 grants on 14 campuses in an effort to spur innovation in sustainability. The funding for these grants came from leveraging $250,000 of system-wide resources slated for energy... more

descriptionView Paper arrow_downwardDownload

Exploring Sentiment in Social Media and Official Statistics: a General Framework

by Manuela Sanguinetti

2022

The integration between official statistics and social media data is a challenging topic. This contribution aims to present a recently-designed framework to compare sentiment analysis on social media content with social and economic data.... more

descriptionView Paper arrow_downwardDownload

A comparison of spatial-based targeted disease mitigation strategies using mobile phone data

by Stefania Rubrichi

2022, EPJ Data Science

Epidemic outbreaks are an important healthcare challenge, especially in developing countries where they represent one of the major causes of mortality. Approaches that can rapidly target subpopulations for surveillance and control are... more

descriptionView Paper arrow_downwardDownload

1 Big Data and Happiness

by Talita Greyling

2022

Figure 8: Emotions of South Africans before and after the enforcement of lockdown regulations. motions changed drastically, though differing in levels from one country to the other. The prominent emotions The emotions most noted across all three countries before COVID-19 were joy, trust and anticipation, but after the

Zealanders with the death of a celebrity.

Source: Dodds et al. (2011). Note: The scale is from 1 extremely negative to 9 extremely positive. levels increase to peak on Saturdays, slightly above the score for Fridays (see figure 3). Regarding the average happiness levels per day of the week, the research team found that Tuesdays, against

Figure 4: South Africa's intraday (hourly) happiness levels during the Rugby World Cup 2019. of a whole nation. Africa, since launching the index in April 2019. This illustrates the power of sporting events to influence the mood

Figure 1: Automatic sentiment analysis system

figure 2 for a representation of the Hedonometer for 2019 up to the day of writing this paper. To construct the Hedonometer, they bin all the tweets extracted on a day; however, they include only those words

Source: Greyling et al. (2019). their new way of living after a major shock. Notes: "Lockdown day" is the first day that regulations came into force for people to isolate socially in different countries. The "average" indicates the average happiness levels in the respective countries before COVID-19. Africa were nearly back to their normal happiness levels, suggesting the incredible ability of people to adjust to

In terms of a societal trauma, the researchers considered New Zealand's reaction to the American professional

Source: Adapted from Sumner (2003). Table 1: Evolution of the concept of human well-being (1950-2015). 3 Tt falls outside of the scope of the paper to discuss the social indicators movement, but we refer readers to the work done by Verlet and Devos (2009) and Diener and Suh (1997) for additional information. how researchers understand and measure human well-being (Table 1).

Source: Dodds et al. (2011). Table 2: Hedonometer, word scores on a 9-point scale of happiness. Once they have derived the happiness values of each word, they use these values to construct the Hedonometer.

descriptionView Paper arrow_downwardDownload

Visualizing Equity : A Data Science for Social Good Tool and Model for Seattle

by Gundula Proksch

2022

descriptionView Paper arrow_downwardDownload

Dengue surveillance based on a computational model of spatio-temporal locality of Twitter

by Adriano Veloso

2022, Proceedings of the 3rd International Web Science Conference on - WebSci '11

Twitter is a unique social media channel, in the sense that users discuss and talk about the most diverse topics, including their health conditions. In this paper we analyze how Dengue epidemic is reflected on Twitter and to what extent... more

descriptionView Paper arrow_downwardDownload

Dengue surveillance based on a computational model of spatio-temporal locality of Twitter

by Adriano Veloso

2022

descriptionView Paper arrow_downwardDownload

Data science for urban equity: Making gentrification an accessible topic for data scientists, policymakers, and the community

by Rachel Berney

2022, Bloomberg Data for Good Exchange Conference

descriptionView Paper arrow_downwardDownload

Geographical veracity of indicators derived from mobile phone data

by Zbigniew Smoreda

2022

In this contribution we summarize insights on the geographical veracity of using mobile phone data to create (statistical) indicators. We focus on problems that persist with spatial allocation, spatial delineation and spatial aggregation... more

descriptionView Paper arrow_downwardDownload

Visualizing Equity: Learning from 'Data Science for Social Good

by Rachel Berney

2021, Proceedings of the 106th ACSA Annual Meeting

Data science has developed a culture of “data science for social good,” or DSSG, to address the ethical dilemma that their work and innovations benefit primarily the corporate and investment sectors. DSSG programs provide data analysis to... more

descriptionView Paper arrow_downwardDownload

Compendio Estadístico del Sector no Lucrativo en México 2021

by Romina Farías Pelayo

2021, Compendio estadístico del sector no lucrativo en México 2021

Esta publicación contiene una estructura informativa y datos estadísticos sobre las organizaciones sin fines de lucro en México hasta 2021. Contiene también datos que dimensionan algunas de las problemáticas actuales a las que se... more

descriptionView Paper arrow_downwardDownload

Misery loves company: happiness and communication in the city

by maryam almehrezi

2021, EPJ Data Science

The high population density in cities confers many advantages, including improved social interaction and information exchange. However, it is often argued that urban living comes at the expense of reducing happiness. The goal of this... more

descriptionView Paper arrow_downwardDownload

Predictability or Early Warning: Using Social Media in Modern Emergency Response

by Carlo Meletti

2021, IEEE Internet Computing

descriptionView Paper arrow_downwardDownload

DOT: a crowdsourcing Mobile application for disease outbreak detection and surveillance in Mauritius

by Lownish Sookha

2021, Health and Technology

Early detection of disease outbreaks is crucial and even small improvements in detection can significantly impact on a country's public health. In this work, we investigate the use of a crowdsourcing application and a real-time disease... more

descriptionView Paper arrow_downwardDownload

DOT: a crowdsourcing Mobile application for disease outbreak detection and surveillance in Mauritius

by Lownish Sookha

2021, Health and Technology

descriptionView Paper arrow_downwardDownload