A clustering and word similarity based approach for identifying product feature words

Research output: Contribution to journalConference article

Abstract

Product designers need to capture feedback from customers in order to assess how the product performs and is perceived in the market. One such example of publicly available source of customer's feedback is the online reviews in an e-commerce website. Two main difficulties in dealing with the reviews are finding relevant words related to a product and grouping different words that represent the same product feature. To overcome these difficulties, both lexical and distributional approaches are utilized in the paper. Using distributional information, words are embedded into real vector space using word2vec and then clustered. Using lexical information from WordNet, the head word for each cluster is identified by considering the similarity with the head words of other clusters. A comparison is made between using X-means and iterative c-means clustering with added word similarity information when breaking a cluster. In the case study of wearable technology products, starting from a large number of words, the approach is shown to identify relevant product feature words.

Original languageEnglish (US)
Pages (from-to)71-80
Number of pages10
JournalProceedings of the International Conference on Engineering Design, ICED
Volume6
Issue numberDS87-6
StatePublished - Jan 1 2017
Event21st International Conference on Engineering Design, ICED 2017 - Vancouver, Canada
Duration: Aug 21 2017Aug 25 2017

Fingerprint

Clustering
Feedback
Vector spaces
Websites
Customers
WordNet
Electronic Commerce
Grouping
Vector space
Similarity
Wearable technology
Review

Keywords

  • Case study
  • Design informatics
  • Market implications

ASJC Scopus subject areas

  • Engineering (miscellaneous)
  • Industrial and Manufacturing Engineering
  • Modeling and Simulation

Cite this

A clustering and word similarity based approach for identifying product feature words. / Suryadi, Dedy; Kim, Harrison Hyung Min.

In: Proceedings of the International Conference on Engineering Design, ICED, Vol. 6, No. DS87-6, 01.01.2017, p. 71-80.

Research output: Contribution to journalConference article

@article{3773c6a8ab234452a59ca93054c18290,
title = "A clustering and word similarity based approach for identifying product feature words",
abstract = "Product designers need to capture feedback from customers in order to assess how the product performs and is perceived in the market. One such example of publicly available source of customer's feedback is the online reviews in an e-commerce website. Two main difficulties in dealing with the reviews are finding relevant words related to a product and grouping different words that represent the same product feature. To overcome these difficulties, both lexical and distributional approaches are utilized in the paper. Using distributional information, words are embedded into real vector space using word2vec and then clustered. Using lexical information from WordNet, the head word for each cluster is identified by considering the similarity with the head words of other clusters. A comparison is made between using X-means and iterative c-means clustering with added word similarity information when breaking a cluster. In the case study of wearable technology products, starting from a large number of words, the approach is shown to identify relevant product feature words.",
keywords = "Case study, Design informatics, Market implications",
author = "Dedy Suryadi and Kim, {Harrison Hyung Min}",
year = "2017",
month = "1",
day = "1",
language = "English (US)",
volume = "6",
pages = "71--80",
journal = "Proceedings of the International Conference on Engineering Design, ICED",
issn = "2220-4334",
number = "DS87-6",

}

TY - JOUR

T1 - A clustering and word similarity based approach for identifying product feature words

AU - Suryadi, Dedy

AU - Kim, Harrison Hyung Min

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Product designers need to capture feedback from customers in order to assess how the product performs and is perceived in the market. One such example of publicly available source of customer's feedback is the online reviews in an e-commerce website. Two main difficulties in dealing with the reviews are finding relevant words related to a product and grouping different words that represent the same product feature. To overcome these difficulties, both lexical and distributional approaches are utilized in the paper. Using distributional information, words are embedded into real vector space using word2vec and then clustered. Using lexical information from WordNet, the head word for each cluster is identified by considering the similarity with the head words of other clusters. A comparison is made between using X-means and iterative c-means clustering with added word similarity information when breaking a cluster. In the case study of wearable technology products, starting from a large number of words, the approach is shown to identify relevant product feature words.

AB - Product designers need to capture feedback from customers in order to assess how the product performs and is perceived in the market. One such example of publicly available source of customer's feedback is the online reviews in an e-commerce website. Two main difficulties in dealing with the reviews are finding relevant words related to a product and grouping different words that represent the same product feature. To overcome these difficulties, both lexical and distributional approaches are utilized in the paper. Using distributional information, words are embedded into real vector space using word2vec and then clustered. Using lexical information from WordNet, the head word for each cluster is identified by considering the similarity with the head words of other clusters. A comparison is made between using X-means and iterative c-means clustering with added word similarity information when breaking a cluster. In the case study of wearable technology products, starting from a large number of words, the approach is shown to identify relevant product feature words.

KW - Case study

KW - Design informatics

KW - Market implications

UR - http://www.scopus.com/inward/record.url?scp=85029747063&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85029747063&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85029747063

VL - 6

SP - 71

EP - 80

JO - Proceedings of the International Conference on Engineering Design, ICED

JF - Proceedings of the International Conference on Engineering Design, ICED

SN - 2220-4334

IS - DS87-6

ER -