You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I ran the train mode on a corpus of size about 1G. I tried twice, each with 500 topics and 500 iterations. But I got two quite different results. I means the 2 files "lad.topToWor.txt" from 2 train results are quite different. I compared the words on each topic( ignored weight) . Only 250 topics on 2 files are similar ( more than 10 words are matched, which I can say that 2 topics in 2 files are similar )
This means that I will get quite different results from a random initialization. Is there a way that I can get a stable result?
increase the number of topics? or increase the iterations? I tried 1000 iterations but no big change.
Thanks!
Yanbo
The text was updated successfully, but these errors were encountered:
Hi
increase the number of topics? or increase the iterations? I tried 1000 iterations but no big change.
Yanbo
The text was updated successfully, but these errors were encountered: