Mistake in script Dictionary-Based Text Analysis.Rmd

Thanks a lot for the tutorial, it helped me guide a friend that barely knows the basics of `R` to get started with text analysis!

While going through it, I found a mistake in the ordering of commands in the "Dictionary-based Text Analysis" - you create the factor after assigning the `top_20` variable and then you plot the `top_20` that are obviously not arranged by frequency. I thought you might want to change this:

https://github.com/cbail/textasdata/blob/18ed454221e93b52d57aa9f121d9a027d072b615/dictionary-methods/rmarkdown/Dictionary-Based%20Text%20Analysis.Rmd#L57-L71

to this:

```R
#create factor variable to sort by frequency 
trump_tweet_top_words$word <- factor(trump_tweet_top_words$word, levels = trump_tweet_top_words$word[order(trump_tweet_top_words$n,decreasing=TRUE)]) 

#select only top words 
top_20<-trump_tweet_top_words[1:20,] 

# library(ggplot2) 
# ggplot...
```

	#select only top words
	top_20<-trump_tweet_top_words[1:20,]

	#create factor variable to sort by frequency
	trump_tweet_top_words$word <- factor(trump_tweet_top_words$word, levels = trump_tweet_top_words$word[order(trump_tweet_top_words$n,decreasing=TRUE)])


	library(ggplot2)
	ggplot(top_20, aes(x=word, y=n, fill=word))+
	geom_bar(stat="identity")+
	theme_minimal()+
	theme(axis.text.x = element_text(angle = 90, hjust = 1))+
	ylab("Number of Times Word Appears in Trump's Tweets")+
	xlab("")+
	guides(fill=FALSE)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mistake in script Dictionary-Based Text Analysis.Rmd #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Mistake in script Dictionary-Based Text Analysis.Rmd #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions