question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Questions about the c-tf-idf

See original GitHub issue

Hi Maarten, I have some questions about the c-tf-idf implementation

image

Is there a reason why you’re using “A” (average number of words per class)? I saw your past medium post about c-tf-idf and I don’t see a clear connection between the two formulas. According to the code, it seems more like a regularization value, is that right? Also, just to be sure:

  • does tf_x,c refer to the count of word x divided by the total words across all documents in c?
  • does f_x refer to the count of word x divided by the total number of documents?

Thanks, and I appreciate all the work you put in this library 😃

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:9 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
MaartenGrcommented, Feb 21, 2022

Just to be sure: I assume that the given image in the code is still a leftover from the old implementation, am I right?

The image that you can find here is currently most up-to-date and should reflect the procedure accurately.

I really appreciate your work and the excellent documentation.

Thank you for your kind words! If you have any other questions, please let me know.

0reactions
lkluocommented, Aug 5, 2022

Thanks @MaartenGr, it is very clear.

Read more comments on GitHub >

github_iconTop Results From Across the Web

35 questions with answers in TF-IDF | Scientific method
TF -IDF - Science method · 1) Should I have to convert all oxides to element first? · 2) Should I have to...
Read more >
20 TF-IDF Interview Questions and Answers - CLIMB
1. What is TF-IDF? · 2. Can you explain what Term Frequency (TF) means in the context of TF-IDF? · 3. Can you...
Read more >
TF-IDF/Term Frequency Technique: Easiest explanation for ...
Because the machine has a score to help aid with these questions, TF-IDF proves a great tool to train machine to answer back...
Read more >
Newest 'tf-idf' Questions - Artificial Intelligence Stack Exchange
I am currently working on a Binary Text Classification Model (False Information Detection) using Support Vector Machine and used TF-IDF as text vectorizer...
Read more >
Explain TF-IDF | Data Science Interview Questions Series
This video discusses about the interview question TF-IDF i.e. Term Frequency - Inverse document frequencyOther AI videos :Data science ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found