美文网首页
word2vec 理解

word2vec 理解

作者: Codlife | 来源:发表于2017-08-06 22:21 被阅读0次

1:word2vec 可以用于时序数据的挖掘,比如商品的浏览分析,app 下载分析,通过word2vec 可以得到商品或 app 的 向量表示,从而用于推荐等,个性化展示
http://ginobefunny.com/post/learning_word2vec/

2:一些使用经验
There's no universal rules-of-thumb, as even what makes a set of word-vectors good for one purpose might not be best for other purposes. (For example, word-vecs that do best on the analogies-test may not be also do the best at a topical-classification task that works on some mean-of-word-vectors.)

That said:

  • be sure to use the latest gensim; earlier versions could be significantly slower on very-short text examples (like tweets)

  • larger window sizes seem to position words closer according to topical-domain/field-of-use/semantic similarity; shorter window sizes position words closer based on functional/syntactic similarity (serve same role in sentence)

  • as your dataset gets larger, sometimes very-small values of window and negative are just as good (or better) and faster than larger values

  • as your dataset gets larger, more-aggressive frequent-word downsampling (the 'sample' parameter becoming smaller but not zero) can offer both speed and quality benefits (by spending fewer training cycles on redundant well-represented words)

  • it's typical to use more than one iteration, but as your data gets larger (and if you're confident word/word-senses are randomly distributed from front to back) the benefits of extra iterations will lessen

  • Gordon

相关文章

  • 基于TensorFlow实现Skip-Gram模型

    理解 Word2Vec 之 Skip-Gram 模型 Word2Vec是从大量文本语料中以无监督的方式学习语义知识...

  • 20201024-词的向量转换之NNLM

    概述 谈到NLP不得不介绍word2vec,word2vec可以很宽泛的理解,就是词的向量化过程,而这一过程是经历...

  • word2vec 理解

    1:word2vec 可以用于时序数据的挖掘,比如商品的浏览分析,app 下载分析,通过word2vec 可以得到...

  • 理解Word2Vec

    独热编码其方法是使用N位状态寄存器来对N个状态进行编码,每个状态都有它独立的寄存器位,并且在任意时候,其中只有一位...

  • Word2Vec原理,彻底搞懂

    References: 图解Word2vec 经典再读之Word2Vec word2vec Parameter L...

  • word2vector简介

    word2vec官网:https://code.google.com/p/word2vec/ word2vec是g...

  • 入门自然语言处理必看,图解word2vec!

    图解word2vec精翻版,加入了自己的理解,和稍微有点出入,http://jalammar.github.io/...

  • word2vec

    word2vec word2vec 构建中文词向量

  • Word2Vector 算法

    谈谈谷歌word2vec的原理github上的资料(1) word2vec 中的数学原理详解 word2vec源码...

  • 通俗理解word2vec

    独热编码 独热编码即 One-Hot 编码,又称一位有效编码,其方法是使用N位状态寄存器来对N个状态进行编码,每个...

网友评论

      本文标题:word2vec 理解

      本文链接:https://www.haomeiwen.com/subject/koinlxtx.html