Record Details

COVID-19 Vaccine Discussions on Reddit with Sentiment, Stance, Topics, and Timing

Harvard Dataverse (Africa Rice Center, Bioversity International, CCAFS, CIAT, IFPRI, IRRI and WorldFish)

View Archive Info
 
 
Field Value
 
Title COVID-19 Vaccine Discussions on Reddit with Sentiment, Stance, Topics, and Timing
 
Identifier https://doi.org/10.7910/DVN/XJTBQM
 
Creator Brambilla, Marco
Kharmale, Kalyani
 
Publisher Harvard Dataverse
 
Description This dataset comprises a list of 1,727 Reddit posts and the respective 12,578 comments posted in the CovidVaccine subreddit from April 2020 to May 2021. Each post and comment is enriched with the following aspects:
1) SENTIMENT: subjectivity score (numerical, range [0..1] ), polarity score (numerical, range [-1..1]), and overall sentiment tag (negative, positive, neutral).
2) TOPIC: a set of subtopics is assigned to posts and comments.
3) STANCE: each post and comment is tagged as "in favor", "neutral", or "against" vaccination.
4) ORDERING: each comment is assigned a position and a categorical temporal distance from the original post.

Each dataset is stored as CSV file.
Files include raw data, training and test samples, and results of the predictions according to a specific implementation.
 
Subject Computer and Information Science
Social Sciences
COVID-19
Vaccine
Discourse
Social media
Sentiment Analysis
 
Contributor Brambilla, Marco