Notice the denominator is simply the overall number of terms in document d (counting Each individual event of a similar phrase individually). There are actually numerous other solutions to determine phrase frequency:[5]: 128 An idf is frequent for each corpus, and accounts for your ratio of documents which include the phrase "this". Within this