教學大綱 Syllabus

科目名稱:社會科學的計算文字分析

Course Name: Computational Text Analysis for the Social Sciences

修別:選

Type of Credit: Elective

3.0

學分數

Credit(s)

15

預收人數

Number of Students

課程資料Course Details

課程簡介Course Description

Content: The explosion of digital text represents an unprecedented opportunity for social research! This course surveys text pre-processing and analysis on three levels. Topics include:

  1. Pre-Processing: cleaning, pre-processing, exploratory analysis
  2. Analyzing Tokens: vectorization, word-embeddings, tagging and naming
  3. Analyzing Documents: text classification, sentiment analysis
  4. Analyzing Corpora: latent topics, semantic networks

Students will examine how text analysis is used to conduct social science. They will also apply text analysis to social science questions. Special attention will be given to generative AI and the ways it is rapidly augmenting text analysis.    

Audience: Designed as a course for IDAS students, students in IMAS or other graduate programs are also welcome to join. No prerequisites and no prior coding experience are required.  Sample code will be provided. And as your instructor, I will walk you through each exercise, step-by-step. No fear! Let’s start coding!

核心能力分析圖 Core Competence Analysis Chart

能力項目說明


    課程目標與學習成效Course Objectives & Learning Outcomes

    You will become familiar with:

    1. Types of digital text and text analysis
    2. The ways digital text and text analysis are impacting the social sciences

    You will practice:

    1. Implementing computational methods on sample code and text
    2. Communicating your computational analyses and their broader applications
    3. Using generative AI to augment your text analysis

    每周課程進度與作業要求 Course Schedule & Requirements

    教學週次Course Week 彈性補充教學週次Flexible Supplemental Instruction Week 彈性補充教學類別Flexible Supplemental Instruction Type

    Week

    Date

    Topic

    Assignment

    Module 1: Pre-Processing

    1

    2/20

    Introduction to Text & Text Analysis

     

    2

    2/27

    Pre-Processing

     

    3

    3/5

    Exploratory Analysis

    Sun 3/10: Code & Review #1

    4

    3/12

    AI Workshop #1

    Sun 3/17: Team Video #1

    5

    3/19

    Application Roundtable #1

     

    Module 2: Analyzing Tokens

    6

    3/26

    Vectorization

     

    7

    4/2

    Tagging and Naming

    Sun 4/7: Code & Review #2

    8

    4/9

    AI Workshop #2

    Sun 4/14: Team Video #2

    9

    4/16

    Application Roundtable #2

     

    Module 3: Analyzing Documents

    10

    4/23

    Machine Learning for Text Classification

     

    11

    4/30

    Sentiment Analysis

    Sun 5/5: Code & Review #3

    12

    5/7

    AI Workshop #3

    Sun 5/12: Team Video #3

    13

    5/14

    Application Roundtable #3

    Sun 5/19: Poster – Mock-up

    Module 4: Analyzing Corpora

    14

    5/21

    Latent Topics

     

    15

    5/28

    Semantic Networks

    Sun 6/2: Code & Review #4

    16

    6/4

    AI Workshop #4

    Sun 6/9: Team Video #4

    17

    6/11

    Application Roundtable #4

    Sun 6/16: Poster – Printed

    Poster Session – Schedule TBD

    18

    6/18

    No class.

     

    授課方式Teaching Approach

    40%

    講述 Lecture

    30%

    討論 Discussion

    30%

    小組活動 Group activity

    0%

    數位學習 E-learning

    0%

    其他: Others:

    評量工具與策略、評分標準成效Evaluation Criteria

    Grading Rubric

    Task

    Points per Assessment

    Percent of Semester Grade

    Attendance

    After TWO FREE ABSENCES, each unexcused absence is -5%

    0%

    Coding

    4 reviews * 5% each

    20%

    Review

    4 tasks * 5% each

    20%

    Team Videos

    4 videos * 10% each

    40%

    Research Poster

    1 poster & presentation

    20%

     

    Coding Tasks – 20%

    A coding task will be assigned for Modules 1-4. The instructor will provide sample code and data. Students will execute the sample code and make small changes as the task requires.

     

    Research Reviews - 20%

    For each of the four modules, a research review will be assigned. The instructor will specify a computational method or type of data. Students will be asked to find three research papers using that type of method or data, and then to summarize and compare the three papers in a 2-page research review. Students may choose papers in a social science field which interests them.

     

    Team Video – 40%

    For each of the four modules, students will work in teams to complete a text analysis project and make a 10-minute video presenting their results. Each project will expand on that module’s coding task and research review. Students will choose a source of text reflecting a social science application of interest to them. Working in teams, students will be asked to conduct an analysis of the text. Teams may use generative AI tools in order to augment their task. Then they will create a team video, also using AI tools if they wish, to present the results of their analysis.

     

    Research Poster & Poster Session – 20%

    As an end-of-semester project, students will prepare a research poster describing their use of text analysis to answer a social science research question. Posters will have sections on background, the student’s theory and hypothesis, data, analysis, discussion and conclusion. Students will submit a draft of their poster for evaluation, revise the poster based on feedback, print their final poster and present it in an end-of-semester poster session.

     

     

    Generative AI

    Students are encouraged to use generative AI to augment any aspects of all assignments including literature reviews, coding and team videos. If AI-generated results do not fully satisfy assignment criteria, some human intervention may be required in order to complete the assignment and receive full credit.

    指定/參考書目Textbook & References

    References

    • Text Analysis
      • Grimmer, J., Roberts, M. E., & Stewart, B. M. (2022). Text as data: A new framework for machine learning and the social sciences. Princeton University Press.
      • Bengfort, B., Bilbro, R., & Ojeda, T. (2018). Applied text analysis with Python: Enabling language-aware data products with machine learning. " O'Reilly Media, Inc.".
      • Szabó, G., Polatkan, G., Boykin, P. O., & Chalkiopoulos, A. (2018). Social media data mining and analytics. John Wiley & Sons.
    • Machine Learning
      • Jacobucci, R., Grimm, K. J., & Zhang, Z. (2023). Machine Learning for Social and Behavioral Research. Guilford Publications.
      • Beyeler, M. (2017). Machine Learning for OpenCV. Packt Publishing Ltd.
    • Social Network Analysis
      • Borgatti, S. P., Everett, M. G., Johnson, J. C., & Agneessens, F. (2022). Analyzing social networks using R. Sage.
      • Jackson, M. O. (2008). Social and economic networks (Vol. 3). Princeton: Princeton university press.

    已申請之圖書館指定參考書目 圖書館指定參考書查詢 |相關處理要點

    維護智慧財產權,務必使用正版書籍。 Respect Copyright.

    課程相關連結Course Related Links

    Moodle Link TBD

    課程附件Course Attachments

    課程進行中,使用智慧型手機、平板等隨身設備 To Use Smart Devices During the Class

    Yes

    列印