SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

Li RD, Ma HT, Wang ZY, Guo Q, Liu JG. J. Saf. Sci. Resil. 2020; 1(1): 36-43.

Copyright

(Copyright © 2020, KeAi Communications, Publisher Elsevier Publishing)

DOI

10.1016/j.jnlssr.2020.06.005

PMID

unavailable

Abstract

Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro-Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions.


Language: en

Keywords

BiLSTM-CRF model; Entity perception; Jaro–Winkler distance algorithm; Public opinions; User comments

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print