“Am I the Asshole” is a popular thread on Reddit where users describe situations and ask other users to vote whether the original poster was the asshole in the situation described in the post (yes, the asshole - YTA) or not (not the asshole – NTA). The aim of this blog post is to analyze a corpus of YTA vs NTA texts and determine whether there are certain linguistic markers that could indicate whether a post will be voted as YTA or NTA.Looking for lexical bundles in a corpus can be a way of finding meaningful patterns that tell us something about the way in which people use language – in this case, who is the asshole and who is not. In this short study, we utilized two corpora, one a collection of Reddit 300 posts where the poster is deemed “the asshole” by their peers, and the other 300 posts where the poster is “not the asshole”.The objective is to extract n-grams using AntConc to see if there is any language specific to the YTA posts compared to the language of those individuals considered to be ‘less assholeish’. After experimenting with the parameters available in AntConc, we decided to search each corpus for case-insensitive 3-word n-grams. N-gram parameters were set to a minimum frequency of 20 and a minimum range of 10 files across both corpora in order for the lexical bundles to be more representative of the sample corpora.The following table depicts the top 20 results for comparison between the YTA and the NTA corpora.