Q3-D3-LSA
dc.contributor.author | Borke, Lukas | |
dc.contributor.author | Härdle, Wolfgang Karl | |
dc.date.accessioned | 2017-10-02T09:22:17Z | |
dc.date.available | 2017-10-02T09:22:17Z | |
dc.date.issued | 2016-11-15 | |
dc.identifier.uri | http://edoc.hu-berlin.de/18452/19102 | |
dc.description.abstract | QuantNet 1 is an integrated web-based environment consisting of different types of statistics-related documents and program codes. Its goal is creating reproducibility and offering a platform for sharing validated knowledge native to the social web. To increase the information retrieval (IR) efficiency there is a need for incorporating semantic information. Three text mining models will be examined: vector space model (VSM), generalized VSM (GVSM) and latent semantic analysis (LSA). The LSA has been successfully used for IR purposes as a technique for capturing semantic relations between terms and inserting them into the similarity measure between documents. Our results show that different model configurations allow adapted similarity-based document clustering and knowledge discovery. In particular, different LSA configurations together with hierarchical clustering reveal good results under M3 evaluation. QuantNet and the corresponding Data-Driven Documents (D3) based visualization can be found and applied under http://quantlet.de. The driving technology behind it is Q3-D3-LSA, which is the combination of “GitHub API based QuantNet Mining infrastructure in R”, LSA and D3 implementation. | eng |
dc.language.iso | eng | |
dc.publisher | Humboldt-Universität zu Berlin | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | QuantNet | eng |
dc.subject | D3 | eng |
dc.subject | GitHub API | eng |
dc.subject | text mining | eng |
dc.subject | document clustering | eng |
dc.subject | similarity | eng |
dc.subject | semantic web | eng |
dc.subject | generalized vector space model | eng |
dc.subject | LSA | eng |
dc.subject | visualization | eng |
dc.subject.ddc | 330 Wirtschaft | |
dc.title | Q3-D3-LSA | |
dc.type | workingPaper | |
dc.identifier.urn | urn:nbn:de:kobv:11-110-18452/19102-6 | |
dc.identifier.doi | http://dx.doi.org/10.18452/18425 | |
local.edoc.pages | 48 | |
local.edoc.type-name | Diskussionspapier | |
local.edoc.container-type | series | |
local.edoc.container-type-name | Schriftenreihe | |
dc.identifier.zdb | 2195055-6 | |
bua.series.name | Sonderforschungsbereich 649: Ökonomisches Risiko | |
bua.series.issuenumber | 2016,49 | |
bua.department | Wirtschaftswissenschaftliche Fakultät |