Skip to content

capstone (for the data and pertained glove file downloading, please check the README.md

Notifications You must be signed in to change notification settings

chenxiaodan105/Congressional_Speeches_Stance_Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 

Repository files navigation

Contents

Introduction

Stance detection is the extraction of a people’s reaction to a claim made by a primary actor. It is a core part of a set of approaches to capture political trends. Companies are sensitive to political policies. It could bring them a lot of benefits, such as the fund, the support from the government, if they could catch up with political trends beforehand.

In this project, NLP method is used to do the prediction based on 2 sessions of congressional bills and transcripts.Just explore it and have fun !

How to Use

  • The data and pretrained GloVe file's size is beyond the GitHub's limit, so you could download them through links below
  • Download Data
  • Download pretrained GloVe for word embedding
  • Go to code file and explore!
file use explaination
data_preprocessing.py text preprocessing tokenization, split data, remove stop words, remove special words and so on
relabel_model.py relabel data using regular expression relabel speeches by detecting key words
DL_Models.ipynb Deep Learning Models two models
DL_Models_with_F1_score.ipynb Deep learning Models with the metric F1 score
ML_Models.ipynb ML Models and Imbalance data Handling six models
cutoff_analysiss_for_speech_length.ipynb EDA EDA

About

capstone (for the data and pertained glove file downloading, please check the README.md

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published