{"id":316,"date":"2007-05-18T08:53:54","date_gmt":"2007-05-18T13:53:54","guid":{"rendered":"http:\/\/blog.uvm.edu\/hag\/2007\/05\/18\/wmatrix-text-analysis-semantic-analysis\/"},"modified":"2007-05-18T08:53:54","modified_gmt":"2007-05-18T13:53:54","slug":"wmatrix-text-analysis-semantic-analysis","status":"publish","type":"post","link":"https:\/\/blog.uvm.edu\/hag\/2007\/05\/18\/wmatrix-text-analysis-semantic-analysis\/","title":{"rendered":"WMatrix: text analysis + semantic analysis"},"content":{"rendered":"<p><img decoding=\"async\" src=\"http:\/\/www.comp.lancs.ac.uk\/ucrel\/wmatrix\/wmatrix_vert.gif\" height=\"180\" align=\"left\"\/>Text Analysis programs that can do word frequency, KWIC, concordancing, etc. are fairly well-established (cf: <a href=\"http:\/\/www.textanalysis.info\/\">Harald Klein&#8217;s text analysis informational pages<\/a> or U of Alberta&#8217;s <a href=\"http:\/\/tapor.ualberta.ca\/Resources\/TASoftware\/\">TaPOR site<\/a>).<br \/>\nWMatrix is a web-based tool that does the standard analysis but, like more recent knowledge mining applications, &#8220;extends the keywords method to key grammatical categories and key semantic fields.&#8221; It also adds a log-likelihood tool to &#8220;perform a comparison of the frequency list for their corpus against another larger normative corpus such as the BNC sampler, or against another of their own texts.&#8221; And does all of this on tagged (HTML, SGML, XML) texts that you upload to their site. Only downside: after the free subscription runs out it costs \u00a3100\/yr to subscribe.<br \/>\nRelated Links<br \/>\n1) <a href=\"http:\/\/gate.ac.uk\/index.html\">GATE: General Architecture for Text Engineering<\/a>, University of Sheffield<br \/>\n2) Nasukawa, T. and T. Nagano, &#8220;<a href=\"http:\/\/researchweb.watson.ibm.com\/journal\/sj\/404\/nasukawa.html\">Text analysis and knowledge mining system<\/a>&#8221;<br \/>\n3) Overview of <a href=\"http:\/\/en.wikipedia.org\/wiki\/Natural_language_processing\">natural language processing at wikipedia<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Text Analysis programs that can do word frequency, KWIC, concordancing, etc. are fairly well-established (cf: Harald Klein&#8217;s text analysis informational pages or U of Alberta&#8217;s TaPOR site). WMatrix is a web-based tool that does the standard analysis but, like more &hellip; <a href=\"https:\/\/blog.uvm.edu\/hag\/2007\/05\/18\/wmatrix-text-analysis-semantic-analysis\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":6,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[16784],"tags":[],"class_list":["post-316","post","type-post","status-publish","format-standard","hentry","category-digital-humanities"],"_links":{"self":[{"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/posts\/316","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/comments?post=316"}],"version-history":[{"count":0,"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/posts\/316\/revisions"}],"wp:attachment":[{"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/media?parent=316"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/categories?post=316"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.uvm.edu\/hag\/wp-json\/wp\/v2\/tags?post=316"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}