{"id":638,"date":"2016-07-19T17:21:18","date_gmt":"2016-07-19T20:21:18","guid":{"rendered":"https:\/\/sbia.org.br\/lnlm\/?page_id=638"},"modified":"2016-07-19T17:21:18","modified_gmt":"2016-07-19T20:21:18","slug":"vol11-no1-art2","status":"publish","type":"page","link":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/","title":{"rendered":"Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair"},"content":{"rendered":"<p><strong>T\u00edtulo:<\/strong> Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair<\/p>\n<p><strong>Autores:<\/strong> Beck, Daniel; Caseli, Helena<\/p>\n<p align=\"justify\"><strong>Resumo:<\/strong> Machine Learning paradigms have dominated recent research in Machine Translation. Current state-of-the-art approaches rely only on statistical methods that gather all necessary knowledge from parallel corpora. However, this lack on explicit linguistic knowledge makes them unable to model some linguistic phenomena. In this work, we focus on models that take into account the syntactic information from the languages involved on the translation process. We follow a novel approach that preprocess parallel corpora using syntactic parsers and uses translation models composed by Tree Transducers. We per- form experiments with English and Brazilian Portuguese, providing the first known results in syntax-based Statistical Machine Translation for this language pair. These results show that this approach is able to better model phenomena like long-distance reordering and give directions to future improvements in building syntax-based translation models for this pair.<\/p>\n<p><strong>Palavras-chave:<\/strong> Statistical Machine Translation; Tree Transducers; Machine Learning<\/p>\n<p><strong>P\u00e1ginas:<\/strong> 15<\/p>\n<p><strong>C\u00f3digo DOI:<\/strong> <a href=\"http:\/\/dx.doi.org\/10.21528\/lnlm-vol11-no1-art2\">10.21528\/lmln-vol11-no1-art2<\/a><\/p>\n<p><strong>Artigo em PDF:<\/strong> <a href=\"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/sites\/4\/2016\/07\/vol11-no1-art2.pdf\" rel=\"\">vol11-no1-art2.pdf<\/a><\/p>\n<p><strong>Arquivo BibTex:<\/strong> <a href=\"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/sites\/4\/2016\/07\/vol11-no1-art2.bib\" rel=\"\">vol11-no1-art2.bib<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>T\u00edtulo: Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair Autores: Beck, Daniel; Caseli, Helena Resumo: Machine Learning paradigms have dominated recent research in Machine Translation. Current state-of-the-art approaches rely only on statistical methods that gather all <a href=\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/\" class=\"read-more\">Read More &#8230;<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":634,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-638","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair - Learning and NonLinear Models<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair - Learning and NonLinear Models\" \/>\n<meta property=\"og:description\" content=\"T\u00edtulo: Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair Autores: Beck, Daniel; Caseli, Helena Resumo: Machine Learning paradigms have dominated recent research in Machine Translation. Current state-of-the-art approaches rely only on statistical methods that gather all Read More ...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/\" \/>\n<meta property=\"og:site_name\" content=\"Learning and NonLinear Models\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. tempo de leitura\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minuto\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/\",\"url\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/\",\"name\":\"Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair - Learning and NonLinear Models\",\"isPartOf\":{\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#website\"},\"datePublished\":\"2016-07-19T20:21:18+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Browse issues\",\"item\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Learning &#038; Nonlinear Models &#8211; L&#038;NLM &#8211; Volume 11 &#8211; N\u00famero 1\",\"item\":\"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#website\",\"url\":\"https:\/\/sbia.org.br\/lnlm\/\",\"name\":\"Learning and NonLinear Models\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/sbia.org.br\/lnlm\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#organization\",\"name\":\"Learning and NonLinear Models\",\"url\":\"https:\/\/sbia.org.br\/lnlm\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/2021\/07\/logo-lnlm.png\",\"contentUrl\":\"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/2021\/07\/logo-lnlm.png\",\"width\":398,\"height\":94,\"caption\":\"Learning and NonLinear Models\"},\"image\":{\"@id\":\"https:\/\/sbia.org.br\/lnlm\/#\/schema\/logo\/image\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair - Learning and NonLinear Models","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/","og_locale":"pt_BR","og_type":"article","og_title":"Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair - Learning and NonLinear Models","og_description":"T\u00edtulo: Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair Autores: Beck, Daniel; Caseli, Helena Resumo: Machine Learning paradigms have dominated recent research in Machine Translation. Current state-of-the-art approaches rely only on statistical methods that gather all Read More ...","og_url":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/","og_site_name":"Learning and NonLinear Models","twitter_card":"summary_large_image","twitter_misc":{"Est. tempo de leitura":"1 minuto"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/","url":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/","name":"Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair - Learning and NonLinear Models","isPartOf":{"@id":"https:\/\/sbia.org.br\/lnlm\/#website"},"datePublished":"2016-07-19T20:21:18+00:00","breadcrumb":{"@id":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/vol11-no1-art2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Browse issues","item":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/"},{"@type":"ListItem","position":2,"name":"Learning &#038; Nonlinear Models &#8211; L&#038;NLM &#8211; Volume 11 &#8211; N\u00famero 1","item":"https:\/\/sbia.org.br\/lnlm\/publicacoes\/vol11-no1\/"},{"@type":"ListItem","position":3,"name":"Tree-Based Statistical Machine Translation: Experiments with the English and Brazilian Portuguese Pair"}]},{"@type":"WebSite","@id":"https:\/\/sbia.org.br\/lnlm\/#website","url":"https:\/\/sbia.org.br\/lnlm\/","name":"Learning and NonLinear Models","description":"","publisher":{"@id":"https:\/\/sbia.org.br\/lnlm\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/sbia.org.br\/lnlm\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Organization","@id":"https:\/\/sbia.org.br\/lnlm\/#organization","name":"Learning and NonLinear Models","url":"https:\/\/sbia.org.br\/lnlm\/","logo":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/sbia.org.br\/lnlm\/#\/schema\/logo\/image\/","url":"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/2021\/07\/logo-lnlm.png","contentUrl":"https:\/\/sbia.org.br\/lnlm\/wp-content\/uploads\/2021\/07\/logo-lnlm.png","width":398,"height":94,"caption":"Learning and NonLinear Models"},"image":{"@id":"https:\/\/sbia.org.br\/lnlm\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/pages\/638","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/comments?post=638"}],"version-history":[{"count":0,"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/pages\/638\/revisions"}],"up":[{"embeddable":true,"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/pages\/634"}],"wp:attachment":[{"href":"https:\/\/sbia.org.br\/lnlm\/wp-json\/wp\/v2\/media?parent=638"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}