CS 4501 / 6501 Analyzing Online Behavior for Public Health

Prof. Henry Kautz
<henry.kautz@virginia.edu>

Monday & Wednesday 3:30-4:45pm
Mechanical Engr Bldg 341

 

Description

People’s online behavior contains signals about their physical and mental health. This course will explore research on using data from users’ interactions with Twitter/X, Google Search, YouTube and other online platforms for tasks ranging from identifying people suffering from anxiety disorder to tracking down restaurants that are sources of food poisoning. We will also read papers on both sides of the ongoing debate about whether social media should be restricted because of potential harm to children or adults.

Course Calendar

Please visit this Google sheet for the the calendar of readings and presentations.

Prerequisites

CS 2100 or permission of the instructor.

Readings and Presentations

Students will be required to read up to 4 research papers each week and write a 1 page summary of each. These summaries should be written manually by the students without using any AI writing or summarization tools. They are due on the day the paper is discussed in class.

In addition, pairs of students will be assigned to present the papers and lead class discussions. The pair of students should work together to create a single Powerpoint (or other presentation software) presentation of about 25 minutes duration. They should strive to thoroughly understand the paper, which may involve reading others papers cited by the paper being presented or learning about the machine learning algorithms the authors used so they can explain them to the class. Students should contact each other at least a week before their presentation to make sure they have time to meet. They should practice the presentation before giving it to the class.

Presenters are responsible for checking the class calendar to know the date they have been assigned. A student who will not be able to attend class on their assigned date should inform the instructor as soon as possible so that they can be assigned to a different date. In addition, if a student decide to drop the class before their presentation, they should inform the instructor as soon as that decision is made so that he can revise the calendar.

The written summaries and presentations should include

Programming Projects

For the course programming projects, students will download and analyze part of their own online footprint. The easiest data users can access is their Google Takeout data, which includes their browsing history, location history, search history, and YouTube viewing history.

Students who already use Google services should consider turning on the “Timeline” feature of Google Maps immediately upon registering for the course (if it is not already on) in order to ensure that they have a rich set of location data to analyze. Students who do not use Google services should obtain some other kind of mapping software that will allow them to download their movement history.

You are encouraged to make use of large language models such as ChatGPT for help coding. You are also free to make use of public code respositories. All such use of LLMs or public repositories must be cited in your report.

Both CS 4501 and CS 6501 students will complete the first two projects:

Only CS 6501 is required to complete a third project; it is optional for CS 4501 students. Students may work on the project alone or in partnership with another student. If you decide to partner with another student, please be sure that you are comfortable sharing your data with that person. You are by no means compelled to share your data with anyone else in order to complete this course! Teams of three students might also be possible, but you should obtain permission from the instructor in advance, and the scale and depth of your project should be larger than the minimum requirements.

Academic Honesty

Using AI tools to create any paper summaries will be taken as academic dishonesty. Passing off someone else’s work as your own without acknowledgement will be taken as academic dishonesty. All cases of suspected academic dishonesty will be referred to the UVA Honor Office.

Grading

Grades will be based on