About: Bitext word alignment

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: Bitext word alignment Goto Sponge NotDistinct Permalink

An Entity of Type : dbo:Agent, within Data Space : dbpedia.demo.openlinksw.com associated with source document(s)
QRcode icon

http://dbpedia.demo.openlinksw.com/describe/?url=http%3A%2F%2Fdbpedia.org%2Fresource%2FBitext_word_alignment&invfp=IFP_OFF&sas=SAME_AS_OFF

Bitext word alignment or simply word alignment is the natural language processing task of identifying translation relationships among the words (or more rarely multiword units) in a bitext, resulting in a bipartite graph between the two sides of the bitext, with an arc between two words if and only if they are translations of one another. Word alignment is typically done after sentence alignment has already identified pairs of sentences that are translations of one another.

Attributes	Values
rdf:type	agent
rdfs:label	Bitext word alignment (en) Dopasowanie wyrazów (pl)
rdfs:comment	Dopasowanie wyrazów – aspekt tłumaczenia statystycznego, zadanie polegające na łączeniu odpowiadających sobie słów między parą zdań, które stanowią wzajemne tłumaczenie. Teksty mogą być elementami korpusu równoległego, uprzednio dopasowanego na poziomie zdań. Wynikiem procesu jest macierz dopasowania wyrazów o wymiarze gdzie i oznaczają ilości słów w odpowiadających sobie zdaniach. Otrzymana macierz stanowi graficzną reprezentację powiązań między wyrazami zdań. (pl) Bitext word alignment or simply word alignment is the natural language processing task of identifying translation relationships among the words (or more rarely multiword units) in a bitext, resulting in a bipartite graph between the two sides of the bitext, with an arc between two words if and only if they are translations of one another. Word alignment is typically done after sentence alignment has already identified pairs of sentences that are translations of one another. (en)
foaf:depiction
dcterms:subject	Machine translation
Wikipage page ID	4558674 (xsd:integer)
Wikipage revision ID	1083758068 (xsd:integer)
Link from a Wikipage to another Wikipage	Natural language processing Machine translation Bipartite graph Hidden Markov model Expectation-maximization algorithm Word sense disambiguation Statistical machine translation Word sense Expectation–maximization algorithm Forward-backward algorithm Unsupervised learning Translation lexicon Part-of-speech Syntactic structure Bitext Generative statistical model
Link from a Wikipage to an external page	http://linguateca.di.uminho.pt/natools/ http://nlp.cs.nyu.edu/GMA/ http://research.variancia.com/unl-aligner http://www.phontron.com/pialign/ https://anymalign.limsi.fr/ https://elrc-share.eu/repository/browse/hunalign/ade4ee46604c11e9a7e100155d0267060a4253f9b3a54dfe83f98bb9affd85a4 https://jasonriesa.github.io/nile/ https://github.com/mhajiloo/berkeleyaligner https://github.com/moses-smt/giza-pp
sameAs	Bitext word alignment Bitext word alignment Bitext word alignment Bitext word alignment Bitext word alignment
dbp:wikiPageUsesTemplate	dbt:About dbt:Main_article dbt:Short_description
thumbnail	wiki-commons:Special:FilePath/Word_alignment.svg?width=300
has abstract	Bitext word alignment or simply word alignment is the natural language processing task of identifying translation relationships among the words (or more rarely multiword units) in a bitext, resulting in a bipartite graph between the two sides of the bitext, with an arc between two words if and only if they are translations of one another. Word alignment is typically done after sentence alignment has already identified pairs of sentences that are translations of one another. Bitext word alignment is an important supporting task for most methods of statistical machine translation. The parameters of statistical machine translation models are typically estimated by observing word-aligned bitexts, and conversely automatic word alignment is typically done by choosing that alignment which best fits a statistical machine translation model. Circular application of these two ideas results in an instance of the expectation-maximization algorithm. This approach to training is an instance of unsupervised learning, in that the system is not given examples of the kind of output desired, but is trying to find values for the unobserved model and alignments which best explain the observed bitext. Recent work has begun to explore supervised methods which rely on presenting the system with a (usually small) number of manually aligned sentences. In addition to the benefit of the additional information provided by supervision, these models are typically also able to more easily take advantage of combining many features of the data, such as context, syntactic structure, part-of-speech, or translation lexicon information, which are difficult to integrate into the generative statistical models traditionally used. Besides the training of machine translation systems, other applications of word alignment include translation lexicon induction, word sense discovery, word sense disambiguation and the cross-lingual projection of linguistic information. (en) Dopasowanie wyrazów – aspekt tłumaczenia statystycznego, zadanie polegające na łączeniu odpowiadających sobie słów między parą zdań, które stanowią wzajemne tłumaczenie. Teksty mogą być elementami korpusu równoległego, uprzednio dopasowanego na poziomie zdań. Wynikiem procesu jest macierz dopasowania wyrazów o wymiarze gdzie i oznaczają ilości słów w odpowiadających sobie zdaniach. Otrzymana macierz stanowi graficzną reprezentację powiązań między wyrazami zdań. (pl)
gold:hypernym	Task
prov:wasDerivedFrom	wikipedia-en:Bitext_word_alignment?oldid=1083758068&ns=0
page length (characters) of wiki page	6458 (xsd:nonNegativeInteger)
foaf:isPrimaryTopicOf	wikipedia-en:Bitext_word_alignment
is Link from a Wikipage to another Wikipage of	Automatic acquisition of sense-tagged corpora Microsoft Translator Word alignment (linguistics)
is Wikipage redirect of	Word alignment (linguistics)
is foaf:primaryTopic of	wikipedia-en:Bitext_word_alignment

Faceted Search & Find service v1.17_git139 as of Feb 29 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3330 as of Mar 19 2024, on Linux (x86_64-generic-linux-glibc212), Single-Server Edition (378 GB total memory, 52 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software