Bitext word alignment or simply word alignment is the natural language processing task of identifying translation relationships among the words (or more rarely multiword units) in a bitext, resulting in a bipartite graph between the two sides of the bitext, with an arc between two words if and only if they are translations of one another. Word alignment is typically done after sentence alignment has already identified pairs of sentences that are translations of one another.
Attributes | Values |
---|
rdf:type
| |
rdfs:label
| - Bitext word alignment (en)
- Dopasowanie wyrazów (pl)
|
rdfs:comment
| - Dopasowanie wyrazów – aspekt tłumaczenia statystycznego, zadanie polegające na łączeniu odpowiadających sobie słów między parą zdań, które stanowią wzajemne tłumaczenie. Teksty mogą być elementami korpusu równoległego, uprzednio dopasowanego na poziomie zdań. Wynikiem procesu jest macierz dopasowania wyrazów o wymiarze gdzie i oznaczają ilości słów w odpowiadających sobie zdaniach. Otrzymana macierz stanowi graficzną reprezentację powiązań między wyrazami zdań. (pl)
- Bitext word alignment or simply word alignment is the natural language processing task of identifying translation relationships among the words (or more rarely multiword units) in a bitext, resulting in a bipartite graph between the two sides of the bitext, with an arc between two words if and only if they are translations of one another. Word alignment is typically done after sentence alignment has already identified pairs of sentences that are translations of one another. (en)
|
foaf:depiction
| |
dcterms:subject
| |
Wikipage page ID
| |
Wikipage revision ID
| |
Link from a Wikipage to another Wikipage
| |
Link from a Wikipage to an external page
| |
sameAs
| |
dbp:wikiPageUsesTemplate
| |
thumbnail
| |
has abstract
| - Bitext word alignment or simply word alignment is the natural language processing task of identifying translation relationships among the words (or more rarely multiword units) in a bitext, resulting in a bipartite graph between the two sides of the bitext, with an arc between two words if and only if they are translations of one another. Word alignment is typically done after sentence alignment has already identified pairs of sentences that are translations of one another. Bitext word alignment is an important supporting task for most methods of statistical machine translation. The parameters of statistical machine translation models are typically estimated by observing word-aligned bitexts, and conversely automatic word alignment is typically done by choosing that alignment which best fits a statistical machine translation model. Circular application of these two ideas results in an instance of the expectation-maximization algorithm. This approach to training is an instance of unsupervised learning, in that the system is not given examples of the kind of output desired, but is trying to find values for the unobserved model and alignments which best explain the observed bitext. Recent work has begun to explore supervised methods which rely on presenting the system with a (usually small) number of manually aligned sentences. In addition to the benefit of the additional information provided by supervision, these models are typically also able to more easily take advantage of combining many features of the data, such as context, syntactic structure, part-of-speech, or translation lexicon information, which are difficult to integrate into the generative statistical models traditionally used. Besides the training of machine translation systems, other applications of word alignment include translation lexicon induction, word sense discovery, word sense disambiguation and the cross-lingual projection of linguistic information. (en)
- Dopasowanie wyrazów – aspekt tłumaczenia statystycznego, zadanie polegające na łączeniu odpowiadających sobie słów między parą zdań, które stanowią wzajemne tłumaczenie. Teksty mogą być elementami korpusu równoległego, uprzednio dopasowanego na poziomie zdań. Wynikiem procesu jest macierz dopasowania wyrazów o wymiarze gdzie i oznaczają ilości słów w odpowiadających sobie zdaniach. Otrzymana macierz stanowi graficzną reprezentację powiązań między wyrazami zdań. (pl)
|
gold:hypernym
| |
prov:wasDerivedFrom
| |
page length (characters) of wiki page
| |
foaf:isPrimaryTopicOf
| |
is Link from a Wikipage to another Wikipage
of | |
is Wikipage redirect
of | |
is foaf:primaryTopic
of | |