The signal matrix for word2vec #3

XuhuiZhou · 2019-04-07T13:45:10Z

In your paper, you seem to use the original PMI matrix as the signal matrix of word2vec. However, in Omer's paper, their conclusion points to the signal matrix PMI-log(k), where the k is the num of negative sampling.

Could you tell me is there a reason to choose PMI instead of PMI-log(k)?

ziyin-dl · 2019-04-07T13:47:37Z

As we said in the paper, we used PPMIfor word2vec without neg sampling, and SPPMI for word2vec with negative sampling. This repo is not for the one in the paper. You probably want to refer to
https://github.com/ziyin-dl/word-embedding-dimensionality-selection

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The signal matrix for word2vec #3

The signal matrix for word2vec #3

XuhuiZhou commented Apr 7, 2019

ziyin-dl commented Apr 7, 2019

The signal matrix for word2vec #3

The signal matrix for word2vec #3

Comments

XuhuiZhou commented Apr 7, 2019

ziyin-dl commented Apr 7, 2019