SOCIAL DATA SCIENCE - 2024/5

Module code: SOCM081

Module Overview

We live in a world where large quantities of data are regularly collected about people, institutions, and social structures. This module will enable students to understand how data science techniques can be used to leverage this data and answer complex questions about the social world. Questions like ‘why some people are more at risk of crime than others?’, ‘what explains differences in life expectancy between countries?’, and ‘do gender inequalities persist in the workplace’.

 

Throughout the module, students will focus on the underlying principles and uses of some of the core data science approaches rather than on the mathematical and statistical theory. Students will therefore develop a solid empirical grounding to be able to critically evaluate the findings from a wide range of quantitative social science research. In the accompanying workshops students will get hands on experience with a number of different social data science approaches in R covering regression approaches for theory testing, machine learning and prediction models, and matching algorithms for quasi-experimental designs. Students will also get a good understanding of how to interpret the results from these approaches and how they can be applied in practice.

Module provider

Sociology

Module Leader

BRUNTON-SMITH Ian (Sociology)

Number of Credits: 15

ECTS Credits: 7.5

Framework: FHEQ Level 7

Module cap (Maximum number of students): N/A

Overall student workload

Workshop Hours: 11

Independent Learning Hours: 106

Lecture Hours: 11

Guided Learning: 11

Captured Content: 11

Module Availability

Semester 2

Prerequisites / Co-requisites

Students must have completed SOCM064 (Social data analytics), or be able to demonstrate that they have received an equivalent level of quantitative methods training

Module content

This module elaborates on social science approaches to data science, combining this with practical model building experience and critique using R. Indicative content includes:

 


  • Designing and building data science models to answer social science questions

  • Querying generative AI chatbots for code enhancements, model troubleshooting, and synthetic data generation.

  • Model testing and cross-validation

  • Sensitivity analyses

  • Interpreting results and finding the narrative



 

Hands on practical workshops will provide students with experience of:

 


  • Generalised Linear Models

  • Machine Learning algorithms

  • Matching approaches and treatment effect estimation


Assessment pattern

Assessment type Unit of assessment Weighting
Coursework Analytic Report 100

Alternative Assessment

N/A

Assessment Strategy

The assessment strategy is designed to provide students with the opportunity to demonstrate:

A detailed understanding of how to use data science approaches to answer questions about the social world. Using real data, students will apply a number of different techniques (regression, machine learning, matching) requiring them to make appropriate decisions based on particular data constraints, and assess the robustness of models. They will also demonstrate their ability to interpret the findings from these models, and discuss their implications.

Thus, the summative assessment for this module consists of:

An analytic report (4,000 words) where students will conduct a data science analysis and write up their results. 

Formative assessment and feedback 

Students will be able to complete workshop exercises during the module, giving them the chance to discuss the answers with each other and the lecturer. They will also have the opportunity to discuss their assignment plans in detail with the lecturer and will need to provide an initial scoping plan for feedback prior to completing the assignments. The assignment is designed to guide students through the research process and so it is also expected that feedback on the research approach and analytic work will be provided throughout the process. Students will also be encouraged to engage with generative AI chatbots to help refine model code for their analysis.

Module aims

  • Provide students with a clear description, definition of, and discussion of some of the most popular approaches to social data science
  • Enable students to critically evaluate the full range of quantitative research conducted in the social sciences
  • Provide students with hands on experience of data science approaches suitable for a range of situations including theory testing, prediction, and causal effects modelling using quasi-experimental design
  • Understand how new developments in generative AI can be leveraged to form a central part of the social data science workflow

Learning outcomes

Attributes Developed
001 Have a critical awareness of the rationale and terminology of social data science approaches K
002 Be able to engage with existing quantitative research, highlighting its key strengths and weaknesses KT
003 Have a comprehensive understanding of the logic of model development and theory testing CP
004 Be able to develop prediction models using machine learning algorithms and test their validity CP
005 Be able to generate plausible control and treatment groups for testing causal effects using quasi-experimental matching approaches CP

Attributes Developed

C - Cognitive/analytical

K - Subject knowledge

T - Transferable skills

P - Professional/Practical skills

Methods of Teaching / Learning

The learning and teaching strategy is designed to:

 

Provide students with a working knowledge of data science techniques to deal with a range of different data scenarios. Alongside theoretical knowledge, workshops will give students practical experience of different techniques, and how they should be applied. Students will also learn how to critically assess their models.

 

The learning and teaching methods include:

 


  • Lectures

  • Practical workshops in R

  • Group discussion and feedback



 

This module is taught in weekly two-hour sessions. Each session will consist of a combination of lectures and hands on practical sessions with the statistics package R (www.r-project.org).

Indicated Lecture Hours (which may also include seminars, tutorials, workshops and other contact time) are approximate and may include in-class tests where one or more of these are an assessment on the module. In-class tests are scheduled/organised separately to taught content and will be published on to student personal timetables, where they apply to taken modules, as soon as they are finalised by central administration. This will usually be after the initial publication of the teaching timetable for the relevant semester.

Reading list

https://readinglists.surrey.ac.uk
Upon accessing the reading list, please search for the module using the module code: SOCM081

Other information

The Department of Sociology is committed to developing graduates with strengths in digital capabilities, employability, global and cultural capabilities, resourcefulness and resilience and sustainability. The MSc programme in Social Research specifically develops these strengths with a view to preparing graduates for careers in social research. This module aims to develop students’ grasp of a set of key skills and awareness to underpin their future research. In particular, it supports students to develop in the following key areas.

 

Digital capabilities. Use of appropriate software and access to digital resources are essential for the contemporary social researcher. This module supports students to develop skills and confidence in the use of the widely used software R and some of the key data science approaches.

 

Employability. The module aims to develop skills that are highly significant in the workplace for a social researcher, with a view to supporting students to be able effectively to use quantitative analysis to inform real world problems. Data science is a clear area of growth in the UK economy and students from this module will be well equipped to contribute in this area.

 

Resourcefulness and resilience. Students are encouraged to see themselves as engaged in a process of continual development of their skills and as needing to keep abreast of emerging developments. Data science techniques often go hand in hand with new developments in AI and students will be encouraged to integrate these tools (e.g. ChatGPT) into their programming workflow.

Please note that the information detailed within this record is accurate at the time of publishing and may be subject to change. This record contains information for the most up to date version of the programme / module for the 2024/5 academic year.