{"id":210,"date":"2011-11-22T00:35:32","date_gmt":"2011-11-22T00:35:32","guid":{"rendered":"http:\/\/luisteixeira.org\/Myblog\/?page_id=210"},"modified":"2013-05-06T22:49:56","modified_gmt":"2013-05-06T22:49:56","slug":"prototipos","status":"publish","type":"page","link":"http:\/\/luisteixeira.org\/Myblog\/en\/prototipos\/","title":{"rendered":"Prototypes"},"content":{"rendered":"<!-- Chitika - WordPress Plugin 2.2--><div class='chitika-adspace above'>\n<script type='text\/javascript'>\n  ( function() {\n    if (window.CHITIKA === undefined) {\n      window.CHITIKA = { 'units' : [] };\n    };\n    var unit = {\n      'publisher'       : 'themaskedwolf',\n      'width'           : 550,\n      'height'          : 250,\n      'sid'             : \"wordpress-plugin above\",\n      'color_site_link' : '0000CC',\n      'color_title'     : '0000CC',\n      'color_text'      : '000000',\n      'color_bg'        : 'ffffff',\n      'font_title'      : 'Arial',\n      'font_text'       : 'Arial',\n      'impsrc'          : 'wordpress',\n      'calltype'        : 'async[2]'\n    };\n    var placement_id = window.CHITIKA.units.length;\n    window.CHITIKA.units.push(unit);\n    var x = \"<di\" + \"v id='chitikaAdBlock-\"+placement_id+\"'><\/di\"+\"v>\";\n    document.write(x);\n}());\n<\/script>\n<script type=\"text\/javascript\" src=\"\/\/cdn.chitika.net\/getads.js\" async><\/script>\n<\/div>\n<p><\/p>\n<h1>Prototypes<\/h1>\n<h2>TOPIXTRACT<\/h2>\n<p><a href=\"http:\/\/luisteixeira.org\/Myblog\/en\/prototipos\/topixtract\/\">TOPIXTRACT <\/a> is a language independent keyterm extractor from documents developed by Me in the framework of my MSC Thesis.<br \/>\nFor this purpose, it takes either words, or multi-words, or word prefixes (with fixed length 4 or 5 characters) as features to represent documents.Then uses 24 measures to identify feature importance for each document discimination.<br \/>\n<br\/ ><br \/>\nResults obtained may be evaluated by independent evaluators and their agreement is meaured usig Kappa statistics. Tf-idf and Chi-square based metrics have shown a higher precision.Word prefixes were used for dealing with highly inflected languages, and topic prefixes were just used as an aid for promoting words and multi-words as possible document topics.<br \/>\n<br\/ ><\/p>\n<p>More information can be obtained in the paper:<a href=\"http:\/\/www.springerlink.com\/content\/x52mv156l6t19732\">Lu\u00eds Teixeira, Gabriel Lopes, and Rita A. Ribeiro, \u201cAutomatic Extraction of Document Topics,\u201d in DoCEIS&#8217;11 &#8211; 2nd Edition of the Doctoral Conference on Computing, Electrical and Industrial Systems, Costa da Caparica, Portugal, 2011, pp. 101\u2013108. <\/a><br \/>\n<br\/ ><\/p>\n<hr align=\"center\" width=\"50%\" \/>\n<h2>BrainMap<\/h2>\n<p><br\/ ><br \/>\nOverwhelming  amounts  of  information  in  corporations  can  make  search  and  browse for  a  specific  topic  or  information  a  very  hard  task.  Therefore,  it  is  of  paramount importance  to  develop  tools  to  ease  the  retrieval  of  specific  information  and  to support  the  exploration  by  users  on  corporate  intranets  (composed  of  several hundreds  of  gigabytes  of  documents).  Although  not  explicitly  identified,  many  of these  documents  are  related  among  themselves  (directly  or  implicitly).<br \/> <br \/>\nThis  project aims  to  enable  the  visual  representation  of  documents  found  to  be  related  among themselves, but also to explore\/mine those relations.<br \/>\nThe  representation, intuitive navigation and selection  of these concepts is our  major goal.  When  certain  relations  between  these  concepts  are  particularly  relevant  they<br \/>\nmay lead to a natural flow of information and consequent navigation between them.   <\/p>\n<p>In  the <a href=\"http:\/\/luisteixeira.org\/Myblog\/prototipos\/brainmap\/\" title=\"BrainMap\">prototype page<\/a>  the studies  for  developing  a  navigation<br \/>\nsupport  system  to  explore  graphs  applied  to  document  correlations,  using  concepts from  the  weighted    complex  network  field,  and  using  their  unstructured  textual content are presented.   <\/p>\n<p>&nbsp;<\/p>\n<p style=\"text-align: center;\"><strong>For any aditional information, please don&#8217;t hesitate in contacting me<\/strong><\/p>\n<p style=\"text-align: center;\">lst<a href=\"http:\/\/www.google.com\/recaptcha\/mailhide\/d?k=019E04HH5yGpgFS8ByKX0WFA==&amp;c=-b_7XlVHfRUKwN8l_JOrMXuHDI-oiT_w8MUgSm4qe2g=\" onclick=\"window.open('http:\/\/www.google.com\/recaptcha\/mailhide\/d?k\\075019E04HH5yGpgFS8ByKX0WFA\\75\\75\\46c\\75-b_7XlVHfRUKwN8l_JOrMXuHDI-oiT_w8MUgSm4qe2g\\075', '', 'toolbar=0,scrollbars=0,location=0,statusbar=0,menubar=0,resizable=0,width=500,height=300'); return false;\" title=\"Reveal this e-mail address\">&#8230;<\/a>teixeira.org<\/p>\n<p><\/p>\n\n<!-- Facebook Like Button v1.9.6 BEGIN [http:\/\/blog.bottomlessinc.com] -->\n<iframe src=\"http:\/\/www.facebook.com\/plugins\/like.php?href=http%3A%2F%2Fluisteixeira.org%2FMyblog%2Fen%2Fprototipos%2F&amp;layout=button_count&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light\" scrolling=\"no\" frameborder=\"0\" allowTransparency=\"true\" style=\"border:none; overflow:hidden; width:450px; height: 60px; align: left; margin: 2px 0px 2px 0px\"><\/iframe>\n<!-- Facebook Like Button END -->","protected":false},"excerpt":{"rendered":"<p>Prototypes TOPIXTRACT TOPIXTRACT is a language independent keyterm extractor from documents developed by Me in the framework of my MSC Thesis. For this purpose, it takes either words, or multi-words, or word prefixes (with fixed length 4 or 5 characters) <a class=\"more-link\" href=\"http:\/\/luisteixeira.org\/Myblog\/en\/prototipos\/\">Continue reading <span class=\"screen-reader-text\">  Prototypes<\/span><span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":6,"comment_status":"closed","ping_status":"open","template":"","meta":{"footnotes":""},"class_list":["post-210","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"http:\/\/luisteixeira.org\/Myblog\/en\/wp-json\/wp\/v2\/pages\/210","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/luisteixeira.org\/Myblog\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"http:\/\/luisteixeira.org\/Myblog\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"http:\/\/luisteixeira.org\/Myblog\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/luisteixeira.org\/Myblog\/en\/wp-json\/wp\/v2\/comments?post=210"}],"version-history":[{"count":15,"href":"http:\/\/luisteixeira.org\/Myblog\/en\/wp-json\/wp\/v2\/pages\/210\/revisions"}],"predecessor-version":[{"id":215,"href":"http:\/\/luisteixeira.org\/Myblog\/en\/wp-json\/wp\/v2\/pages\/210\/revisions\/215"}],"wp:attachment":[{"href":"http:\/\/luisteixeira.org\/Myblog\/en\/wp-json\/wp\/v2\/media?parent=210"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}