All About “COVID-17”

The viewer is made up of several components. See below for an outline of each major section. the viewer
Red: The date selector. Scroll up and down to expose the full 90-day range.

Brown: Contains the header and starter text. The “starter text” is the sub-header CBC articles typically contain. This was the prompt used for the GPT-2 generated text. Scrollable.

Blue: The two articles. Scrollable. Healthcare-related words are automatically highlighted to provide a cursory look at differences.

Purple: Various statistics. Similarity is judged by the positioning of certain words. The sentimentality value (“tone”) is judged by the use of adjectives.

Green: The chart shows the relative tone values for all articles, real and fake, over the full period. The higher the column, the more positive the article. The current date is highlighted in red. The columns are selectable!

Sources

Dai, Tianru. “News Articles.” Version 1.0, Harvard Dataverse, March 2017, https://doi.org/10.7910/DVN/GMFCTR.

Gwern. “GPT-2 Neural Network Poetry.” October 2019, https://www.gwern.net/GPT-2.

Han, Ryan. “COVID-19 News Articles Open Research Dataset.” Version 3.0, Kaggle, May 2020, https://www.kaggle.com/ryanxjhan/cbc-news-coronavirus-articles-march-26.