embedding_lookup

作者: 闫品品 | 来源:发表于2019-07-12 10:51 被阅读0次

tensorflow学习-embedding_lookup()用
embedding_lookup
tensorflow教程-embedding_lookup用法
tensorflow中embedding_lookup()用法

参数：
input_ids：shape为包含了word ids的[batch_size, seq_length]的tensor
vocab_size：embedding vocabulary的size
embedding_size：word embeddings的width
initializer_range：Embedding初始化的range
word_embedding_name：embedding table的名字
use_one_hot_embeddings：true: 使用one-hot的embedding；false：使用tf.nn.embedding_lookup()，如下所述，tpu用one-hot好，cpu/gpu用非one-hot好

返回:
1、shape为[batch_size, seq_length, embedding_size]的tensor
2、shape为[vocab_size, embedding_size]的整个embedding_table

def embedding_lookup(input_ids,
                     vocab_size,
                     embedding_size=128,
                     initializer_range=0.02,
                     word_embedding_name="word_embeddings",
                     use_one_hot_embeddings=False):
    """Looks up words embeddings for id tensor.

  Args:
    input_ids: int32 Tensor of shape [batch_size, seq_length] containing word
      ids.
    vocab_size: int. Size of the embedding vocabulary.
    embedding_size: int. Width of the word embeddings.
    initializer_range: float. Embedding initialization range.
    word_embedding_name: string. Name of the embedding table.
    use_one_hot_embeddings: bool. If True, use one-hot method for word
      embeddings. If False, use `tf.nn.embedding_lookup()`. One hot is better
      for TPUs.

  Returns:
    float Tensor of shape [batch_size, seq_length, embedding_size].
  """
    # This function assumes that the input is of shape [batch_size, seq_length,
    # num_inputs].
    #
    # If the input is a 2D tensor of shape [batch_size, seq_length], we
    # reshape to [batch_size, seq_length, 1].
    if input_ids.shape.ndims == 2:
        input_ids = tf.expand_dims(input_ids, axis=[-1])

    embedding_table = tf.get_variable(
        name=word_embedding_name,
        shape=[vocab_size, embedding_size],
        initializer=create_initializer(initializer_range))

    if use_one_hot_embeddings:
        flat_input_ids = tf.reshape(input_ids, [-1])
        one_hot_input_ids = tf.one_hot(flat_input_ids, depth=vocab_size)
        output = tf.matmul(one_hot_input_ids, embedding_table)
    else:
        output = tf.nn.embedding_lookup(embedding_table, input_ids)

    input_shape = get_shape_list(input_ids)

    output = tf.reshape(output,
                        input_shape[0:-1] + [input_shape[-1] * embedding_size])
    return (output, embedding_table)

网友评论

本文标题：embedding_lookup

本文链接：https://www.haomeiwen.com/subject/nhfekctx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

embedding_lookup

相关文章

tensorflow学习-embedding_lookup()用