{"id":662,"date":"2016-07-20T13:53:28","date_gmt":"2016-07-20T16:53:28","guid":{"rendered":"https:\/\/sbia.org.br\/lnlm\/?page_id=662"},"modified":"2016-07-20T13:53:28","modified_gmt":"2016-07-20T16:53:28","slug":"vol12-no1-art3","status":"publish","type":"page","link":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/","title":{"rendered":"Using Clustering and Text Mining to Create a Reference Price Database"},"content":{"rendered":"<p><strong>T\u00edtulo:<\/strong> Using Clustering and Text Mining to Create a Reference Price Database<\/p>\n<p><strong>Autores:<\/strong> Carvalho, Rommel; Paiva, Eduardo de; Rocha, Henrique da; Mendes, Gilson<\/p>\n<p align=\"justify\"><strong>Resumo:<\/strong> Since 2004, Brazil`s Office of the Comptroller General (CGU) has been publishing several data related to government expenditures in the Transparency Portal. In 2010, CGU started publishing daily every financial statement produced by the Federal Government. Nevertheless, inconsistencies which hinder accountability have been found in this data base. This paper presents how CGU uses clustering and text mining techniques to retrieve essential information for a good accountability, which includes what was bought, the price paid per item, a price reference per product, etc. This analysis has allowed CGU to draw some preliminary conclusions which are presented as a means to illustrate the research results. Finally, this information will eventually be incorporated in the Transparency Portal, allowing every citizen to understand how much the Government is really paying, in general, for products. Thus, improving social control and providing a solid accountability not only to CGU, as an internal control agency, but also to Brazil`s citizens who, in the end, are the ones paying the bill.<\/p>\n<p><strong>Palavras-chave:<\/strong> Reference price; cluster; text mining; public expenditure; accountability; note of purchase; government purchase<\/p>\n<p><strong>P\u00e1ginas:<\/strong> 15<\/p>\n<p><strong>C\u00f3digo DOI:<\/strong> <a href=\"http:\/\/dx.doi.org\/10.21528\/lnlm-vol12-no1-art3\">10.21528\/lmln-vol12-no1-art3<\/a><\/p>\n<p><strong>Artigo em PDF:<\/strong> <a href=\"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/sites\/4\/2016\/07\/vol12-no1-art3.pdf\" rel=\"\">vol12-no1-art3.pdf<\/a><\/p>\n<p><strong>Arquivo BibTex:<\/strong> <a href=\"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/sites\/4\/2016\/07\/vol12-no1-art3.bib\" rel=\"\">vol12-no1-art3.bib<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>T\u00edtulo: Using Clustering and Text Mining to Create a Reference Price Database Autores: Carvalho, Rommel; Paiva, Eduardo de; Rocha, Henrique da; Mendes, Gilson Resumo: Since 2004, Brazil`s Office of the Comptroller General (CGU) has been publishing several data related to <a href=\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/\" class=\"read-more\">Read More &#8230;<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":656,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-662","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Using Clustering and Text Mining to Create a Reference Price Database - Learning and NonLinear Models<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Using Clustering and Text Mining to Create a Reference Price Database - Learning and NonLinear Models\" \/>\n<meta property=\"og:description\" content=\"T\u00edtulo: Using Clustering and Text Mining to Create a Reference Price Database Autores: Carvalho, Rommel; Paiva, Eduardo de; Rocha, Henrique da; Mendes, Gilson Resumo: Since 2004, Brazil`s Office of the Comptroller General (CGU) has been publishing several data related to Read More ...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/\" \/>\n<meta property=\"og:site_name\" content=\"Learning and NonLinear Models\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. tempo de leitura\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minuto\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/\",\"url\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/\",\"name\":\"Using Clustering and Text Mining to Create a Reference Price Database - Learning and NonLinear Models\",\"isPartOf\":{\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#website\"},\"datePublished\":\"2016-07-20T16:53:28+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Browse issues\",\"item\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Learning &#038; Nonlinear Models &#8211; L&#038;NLM &#8211; Volume 12 &#8211; N\u00famero 1\",\"item\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Using Clustering and Text Mining to Create a Reference Price Database\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#website\",\"url\":\"https:\/\/sbia.org.br\/lnlm\/\",\"name\":\"Learning and NonLinear Models\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/sbia.org.br\/lnlm\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#organization\",\"name\":\"Learning and NonLinear Models\",\"url\":\"https:\/\/sbia.org.br\/lnlm\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/2021\/07\/logo-lnlm.png\",\"contentUrl\":\"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/2021\/07\/logo-lnlm.png\",\"width\":398,\"height\":94,\"caption\":\"Learning and NonLinear Models\"},\"image\":{\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#\/schema\/logo\/image\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Using Clustering and Text Mining to Create a Reference Price Database - Learning and NonLinear Models","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/","og_locale":"pt_BR","og_type":"article","og_title":"Using Clustering and Text Mining to Create a Reference Price Database - Learning and NonLinear Models","og_description":"T\u00edtulo: Using Clustering and Text Mining to Create a Reference Price Database Autores: Carvalho, Rommel; Paiva, Eduardo de; Rocha, Henrique da; Mendes, Gilson Resumo: Since 2004, Brazil`s Office of the Comptroller General (CGU) has been publishing several data related to Read More ...","og_url":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/","og_site_name":"Learning and NonLinear Models","twitter_card":"summary_large_image","twitter_misc":{"Est. tempo de leitura":"1 minuto"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/","url":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/","name":"Using Clustering and Text Mining to Create a Reference Price Database - Learning and NonLinear Models","isPartOf":{"@id":"https:\/\/sbia.org.br\/lnlm\/#website"},"datePublished":"2016-07-20T16:53:28+00:00","breadcrumb":{"@id":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/vol12-no1-art3\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Browse issues","item":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/"},{"@type":"ListItem","position":2,"name":"Learning &#038; Nonlinear Models &#8211; L&#038;NLM &#8211; Volume 12 &#8211; N\u00famero 1","item":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol12-no1\/"},{"@type":"ListItem","position":3,"name":"Using Clustering and Text Mining to Create a Reference Price Database"}]},{"@type":"WebSite","@id":"https:\/\/sbia.org.br\/lnlm\/#website","url":"https:\/\/sbia.org.br\/lnlm\/","name":"Learning and NonLinear Models","description":"","publisher":{"@id":"https:\/\/sbia.org.br\/lnlm\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/sbia.org.br\/lnlm\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Organization","@id":"https:\/\/sbia.org.br\/lnlm\/#organization","name":"Learning and NonLinear Models","url":"https:\/\/sbia.org.br\/lnlm\/","logo":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/sbia.org.br\/lnlm\/#\/schema\/logo\/image\/","url":"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/2021\/07\/logo-lnlm.png","contentUrl":"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/2021\/07\/logo-lnlm.png","width":398,"height":94,"caption":"Learning and NonLinear Models"},"image":{"@id":"https:\/\/sbia.org.br\/lnlm\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/pages\/662","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/comments?post=662"}],"version-history":[{"count":0,"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/pages\/662\/revisions"}],"up":[{"embeddable":true,"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/pages\/656"}],"wp:attachment":[{"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/media?parent=662"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}