Results of Topic Modeling

Topic modeling in class proved to be methodically interesting and useful for a method of natural language processing and figuring out the statistical settings in the words to conclude explicit topics from them. This novel tool aided in “highlighting” key words that were highly significant and repetitive in the Sherlock Holmes stories.

One of the topics that I had generalized consisted of words that describe a place or a setting such as, “room door window light open long passed house stood round opened front dark night entered”. From these words, it was quite evident that the program was trying to distinguish an environment when an event had occurred. Another unique set of words that were distinguished and brought to my attention were, “woman lady wife husband life love left boy child nature loved beautiful maid ferguson happy madam women mistress devoted wonderful”. I characterized it with the title “Family” due to the concepts of love and relationships described with the words. Another set of words were, “man lay dead poor body professor blood close carried end terrible death struck moment broken shot strange deep long water” which depicted a negative connotation with pessimistic adjectives. I characterized this set of words with “Depression” as they were very sorrowful and lonely descriptive words. A set of words that described economic value and worth in the Sherlock Holmes stories were, ” business client money england hundred king pounds thousand large set gold photograph paid pay ten give draw fifty ready worth”. These words depicted importance of money and greed in the stories.