《Joint Extraction of Entities an

作者: best___me | 来源:发表于2017-10-31 14:31 被阅读0次

《Joint Extraction of Entities an
《Joint Extraction of Entities an
CoType: Joint Extraction of Type
Joint, Marginal, and Conditional
Symfony 通过已存在的数据库生成实体类。Create en
NetCore 左连接demo
cesium中改变创建对象的颜色
Joint
2022-02-25
MML(skl)——C3

来源：ACL2017

Joint extraction of entities and relations :监测实体和他们之间的关系，同时的，from unstructured text。

Open IE：relation words 从给定的句子中提取，relation words are extracted from a predefined relation set which may not appear in the given sentence.

过去的方法：pipelined manner，例如先提取实体，然后识别他们的关系。简单，灵活，但是忽视了两个任务之间的关系，每个任务都是独立的。实体识别的结果会影响关系分类并导致错误。

joint learning framework：可以有效的集成实体和关系的信息。大多数现有的joint method are feature-based structured system。他们需要复杂的特征，依赖其他NLP toolkits，可能导致错误的propagation。

Miwa and Bansal, 2016提出一种端到端的实体和关系的提取，尽管实体和关系共享参数，它还是可以分别提取实体和关系。

本文提出：tagging schema 结合端对端的模型，转换成tagging problem。有监督学习方法。自己标注会很耗费精力和有错误，本文使用公开的数据集。

本文主要的contribution：（1）tagging scheme（2）tagging-based methods are better than most of the existing pipelined and joint learning methods（3）Furthermore，we also develop an end-to-end model with biased loss function to suit for the novel tags。It can enhance the association between related entities

Method：

本文提出一种tagging机制的端对端的模型，该模型有有偏的目标函数，联合提取实体和它们之间的关系。

如果一个句子中包含多个相同的关系类型，以就近原则结合两个实体。

BIES(Begin, Inside, End, Single)：实体中的位置信息

An extracted result is represented by a triplet：(Entity1, RelationType, Entity2).

End-to-end Model：

Model

包含一个Bi-LSTM层用来encode输入句子，和一个LSTM的decoding层，该层包含biased loss。Biased loss可以enhance实体tag之间的关系。

The Bi-LSTM encoding layer: 包含一个前向lstm层，一个后向lstm层和一个连接层(concatenate layer)。 word embedding层将词从1-hot representation转换成embedding vector。因此，一系列的词可以表示成：W = {w1, ... wt, wt+1, ... , wn}。 wt是一个d维词向量，word vector，代表句长为n的第t个词向量。

在word embedding layer之后，有两个平行的LSTM层：前向LSTM和后向LSTM

BiLSTM: