About: Latent Dirichlet allocation

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: Latent Dirichlet allocation Goto Sponge NotDistinct Permalink

An Entity of Type : yago:WikicatProbabilisticModels, within Data Space : dbpedia.demo.openlinksw.com associated with source document(s)
QRcode icon

http://dbpedia.demo.openlinksw.com/c/D8fsHqusE

In natural language processing, Latent Dirichlet Allocation (LDA) is a generative statistical model that explains a set of observations through unobserved groups, and each group explains why some parts of the data are similar. The LDA is an example of a topic model. In this, observations (e.g., words) are collected into documents, and each word's presence is attributable to one of the document's topics. Each document will contain a small number of topics.

Attributes	Values
rdf:type	Thing person yago:WikicatLatentVariableModels yago:WikicatStatisticalModels yago:Assistant109815790 yago:CausalAgent100007347 yago:LivingThing100004258 yago:Model110324560 yago:Object100002684 yago:Organism100004475 yago:Person100007846 yago:PhysicalEntity100001930 yago:Worker109632518 yago:YagoLegalActor yago:YagoLegalActorGeo yago:Whole100003553 yago:WikicatProbabilisticModels
rdfs:label	Assignació Latent de Dirichlet (ca) Latent Dirichlet Allocation (de) Latent Dirichlet Allocation (es) Allocation de Dirichlet latente (fr) Latent Dirichlet allocation (en) 잠재 디리클레 할당 (ko) Alocação latente de Dirichlet (pt) Латентное размещение Дирихле (ru) 隐含狄利克雷分布 (zh)
rdfs:comment	En el processament de llenguatge natural, l'Assignació Latent de Dirichlet (LDA, de l'anglès Latent Dirichlet Allocation) és un modelatge de temes que permet analitzar els temes dels que tracten diferents textos. Es considera que cada text és una barreja d'un nombre reduït de temes, i que la presència de cada paraula al text és atribuïble a un dels temes del document. (ca) Latent Dirichlet allocation (LDA) ist ein von David Blei, Andrew Ng und Michael I. Jordan im Jahre 2003 vorgestelltes generatives Wahrscheinlichkeitsmodell für „Dokumente“. Das Modell ist identisch zu einem 2000 publizierten Modell zur Genanalyse von , und P. Donnelly. Dokumente sind in diesem Fall gruppierte, diskrete und ungeordnete Beobachtungen (im Folgenden „Wörter“ genannt). In den meisten Fällen werden Textdokumente verarbeitet, in denen Wörter gruppiert werden, wobei die Wortreihenfolge keine Rolle spielt. Es können aber auch z. B. Pixel aus Bildern verarbeitet werden. (de) En aprendizaje automático, la Asignación Latente de Dirichlet (ALD) o Latent Dirichlet Allocation (LDA) es un modelo generativo que permite que conjuntos de observaciones puedan ser explicados por grupos que explican porqué algunas partes de los datos son similares. Por ejemplo, si las observaciones son palabras en documentos, presupone que cada documento es una mezcla de un pequeño número de categorías (también denominados como tópicos) y la aparición de cada palabra en un documento se debe a una de las categorías a las que el documento pertenece. LDA es un ejemplo de y fue presentado como un modelo en grafo para descubrir categorías por David Blei, Andrew Ng y Michael Jordan en 2002. (es) Dans le domaine du traitement automatique des langues, l’allocation de Dirichlet latente (de l’anglais Latent Dirichlet Allocation) ou LDA est un modèle génératif probabiliste permettant d’expliquer des ensembles d’observations, par le moyen de groupes non observés, eux-mêmes définis par des similarités de données. (fr) In natural language processing, Latent Dirichlet Allocation (LDA) is a generative statistical model that explains a set of observations through unobserved groups, and each group explains why some parts of the data are similar. The LDA is an example of a topic model. In this, observations (e.g., words) are collected into documents, and each word's presence is attributable to one of the document's topics. Each document will contain a small number of topics. (en) 자연어 처리에서 잠재 디리클레 할당(Latent Dirichlet allocation, LDA)은 주어진 문서에 대하여 각 문서에 어떤 주제들이 존재하는지를 서술하는 대한 확률적 토픽 모델 기법 중 하나이다. 미리 알고 있는 주제별 단어수 분포를 바탕으로, 주어진 문서에서 발견된 단어수 분포를 분석함으로써 해당 문서가 어떤 주제들을 함께 다루고 있을지를 예측할 수 있다. (ko) No processamento de linguagem natural, a alocação latente de Dirichlet (LDA) é um modelo estatístico generativo. Ele permite que conjuntos de observações sejam explicados por variáveis latentes que explicam por que algumas partes dos dados são semelhantes. Por exemplo, se as observações são palavras coletadas em documentos, ele postula que cada documento é uma mistura de um pequeno número de tópicos e que a presença de cada palavra é atribuível a um dos tópicos do documento. O LDA é um exemplo de modelo de tópicos e pertence às ferramentas principais do campo do aprendizado de máquinas e, em sentido mais amplo, às ferramentas de inteligência artificial. (pt) 隐含狄利克雷分布（英語：Latent Dirichlet allocation，简称LDA），是一种主题模型，它可以将文档集中每篇文档的主题按照概率分布的形式给出。同时它是一种无监督学习算法，在训练时不需要手工标注的训练集，需要的仅仅是文档集以及指定主题的数量k即可。此外LDA的另一个优点则是，对于每一个主题均可找出一些词语来描述它。 LDA首先由 David M. Blei、吴恩达和迈克尔·I·乔丹于2003年提出，目前在文本挖掘领域包括文本主题识别、文本分类以及文本相似度计算方面都有应用。 (zh) Латентное размещение Дирихле (LDA, от англ. Latent Dirichlet allocation) — применяемая в машинном обучении и информационном поиске , позволяющая объяснять результаты наблюдений с помощью неявных групп, благодаря чему возможно выявление причин сходства некоторых частей данных. Например, если наблюдениями являются слова, собранные в документы, утверждается, что каждый документ представляет собой смесь небольшого количества тем и что появление каждого слова связано с одной из тем документа. LDA является одним из методов тематического моделирования и впервые был представлен в качестве графовой модели для обнаружения тематик Дэвидом Блеем, Эндрю Ыном и Майклом Джорданом в 2003 году. (ru)
differentFrom	Linear discriminant analysis
rdfs:seeAlso	Dirichlet-multinomial distribution
foaf:depiction
dct:subject	Probabilistic models Latent variable models Statistical natural language processing
Wikipage page ID	4605351 (xsd:integer)
Wikipage revision ID	1109991465 (xsd:integer)
Link from a Wikipage to another Wikipage	Probabilistic models Probabilistic graphical model Multinomial distribution Natural language processing Non-negative matrix factorization Probabilistic latent semantic analysis Bayesian network Peter Donnelly David Blei Independent component analysis Information retrieval Confounding Gamma function Generative model Gensim Statistical inference Andrew Ng Apache Spark Machine learning Chinese restaurant process Stop words Latent variable models Computational musicology Population genetics Probabilistic latent semantic indexing MapReduce Statistical natural language processing Jonathan K. Pritchard K-means clustering NumPy Dirichlet-multinomial distribution Dirichlet distribution Hadoop Hierarchical Dirichlet process Latent variable Infer.NET Michael I. Jordan Categorical distribution R (programming language) Matthew Stephens (statistician) Variational Bayesian methods Expectation propagation Latent semantic indexing Observable variable Plate notation Tf-idf Gibbs sampling Pachinko allocation Reversible-jump Markov chain Monte Carlo Topic model Variational Bayes Association studies Bayesian estimator Gamma-Poisson distribution Collapsed Gibbs sampling Posterior distribution

Faceted Search & Find service v1.17_git147 as of Sep 06 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3331 as of Sep 2 2024, on Linux (x86_64-generic-linux-glibc212), Single-Server Edition (378 GB total memory, 50 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software