The Constitution of a Fine-Grained Opinion Annotated Corpus on Weibo

来源 :第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD | 被引量 : 0次 | 上传用户：xingli1314

【摘要】

：

【作者】

：

Liao Jian Li Yang Wang Suge

【机构】

：

School of Computer & Information Technology,Shanxi University,Taiyuan,030006,China

【出处】

：

第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD

【发表日期】

：

2016年6期

【关键词】

：

weibo corpus fine-grained opinion annotation implicit opinion annotation evaluat

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

　　Sentiment analysis on social media represented by Weibo is one of the hotspot research problems in NLP.A comprehensive and systematic fine-grained annotated corpus plays a significance role.In this paper,considering the characteristics of Weibo,we focus on the constitution of a fine-grained,hierarchical opinion annotated corpus and design a set of labelling specification.We manually annotate the opinion sentences with a part of ones containing hidden opinion which can be useful for implicit sentiment analysis.Then a fine-grained aspect extraction,namely opinion triples like is finished for aspect-level sentiment research.Moreover,we establish an evaluation method for the task of fine-grained aspect extraction which has been applied in evaluation for years.The corpus was used in the task of COAE2015,and it will be a useful resource for the related research on social media sentiment analysis.

其他文献

基于点关联测度矩阵分解的中英跨语言词嵌入

研究基于矩阵分解的词嵌入方法,提出统一的描述模型,并应用于中英跨语言词嵌入问题.以双语对齐语料为知识源,提出跨语言关联词计算方法和两种点关联测度的计算方法:跨语言共现计数和跨语言点互信息.分别设计目标函数学习中英跨语言词嵌入.从目标函数、语料数据、向量维数等角度进行实验,结果表明:在中英跨语言文档分类中以前者作为点关联测度最高得到87.04％的准确率;在中英跨语言词义相似度计算中,后者作为点关联测

会议

跨语言词嵌入点关联测度矩阵分解汉语英语

依存边转换翻译规则生成器

统计机器翻译模型,特别是基于句法的翻译模型,其翻译单元在保留足够的翻译信息以及翻译单元在翻译新句子时的泛化能力上始终存在着一个平衡.神经网络被成功用于统计机器翻译模型中的调序和语言生成中.本文提出了一个新颖的基于神经网络的句法翻译规则生成器——依存边转换翻译规则生成器(DETG),它利用一条转换翻译规则的源端以及源端的上下文作为输入,以依存边转换翻译规则的目标端作为输出.它不仅保留了依存边——这种

会议

统计机器翻译依存边转换翻译规则生成器

Improving Chinese Semantic Role Labeling with English Proposition Bank

Most researches to SRL focus on English.It is still a challenge to improve the SRL performance of other language.In this paper,we introduce a two-pass approach to do Chinese SRL with a Recurrent Neura

会议

Chinese semantic role labelingtwo-pass approachRecurrent Neural NetworkEnglis

Improved Graph-based Dependency Parsing via Hierarchical LSTM Networks

In this paper,we propose a neural graph-based dependency parsing model which utilizes hierarchical LSTM networks on character level and word level to learn word representations,allowing our model to a

会议

Graph-based dependency parsingHierarchical LSTM

Error Analysis of English-Chinese Machine Translation

In order to explore a practical way of improving machine translation(MT)quality,the error types and distribution of MT results have to be analyzed first.This paper analyzed English-Chinese MT errors f

会议

Machine TranslationError AnalysisNT clausesSV clausesNon-SV clauses

Using Collaborative Training Method to build Vietnamese Dependency Treebank

For the difficulty of marking Vietnamese dependency tree,this paper proposed the method which combined MST algorithm and improved Nivre algorithm to build Vietnamese dependency treebank.The method too

会议

Dependency TreebankVietnameseCollaborative TrainingDependency Parsing

Coping with Problems of Unicoded Traditional Mongolian

Traditional Mongolian Unicode Encoding has serious problems as several pairs of vowels with the same glyphs but different pronunciations are coded differently.We expose the severity of the problem by

会议

Traditional Mongolian ScriptHomographsInput MethodNormalization

Semi-supervised Learning for Mongolian Morphological Segmentation

Unlike previous Mongolian morphological segmentation methods based on large labeled training data or complicated rules concluded by linguists,we explore a novel semi-supervised method for a practical

会议

Semi-supervised learningMorphological segmentationStatistical machine translat

Recognizing Biomedical Named Entities Based on the Sentence Vector/Twin Word Embeddings Conditioned

As a fundamental step in biomedical information extraction tasks,biomedical named entity recognition remains challenging.In recent years,the neural network has been applied on the entity recognition t

会议

LSTMtwin word embeddingssentence vectorViterbi algorithm

Combining Event-level and Cross-event Semantic Information for Event-Oriented Relation Classificatio

Previous researches on event relation classification primarily rely on lexical and syntactic features.In this paper,we use a Shallow Convolutional Neural Network(SCNN)to extract event-level and cross-

会议

Event Relation ClassificationSemantic InformationFrame EmbeddingSCNN

The Constitution of a Fine-Grained Opinion Annotated Corpus on Weibo

与本文相关的学术论文