An explanation of how Tf-Idf and Word2Vec work.
Remember the Documents we started with:
- Document 1: Maria is going to buy apples.
- Document 2: Maria is going to get grapes.
- Document 3: Maria is going to work.
And their classes:
- Class 0: Fruit Buying Trip
- Class 1: Work Trip
It would be great if a method was available to mark words as ‘not important’ when converting them into numbers. Maybe their score could be lowered and instead of 1 be a smaller number as to reflect this importance.