About: MinHash

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: MinHash Goto Sponge NotDistinct Permalink

An Entity of Type : yago:WikicatProbabilisticDataStructures, within Data Space : dbpedia.demo.openlinksw.com associated with source document(s)
QRcode icon

http://dbpedia.demo.openlinksw.com/describe/?url=http%3A%2F%2Fdbpedia.org%2Fresource%2FMinHash&invfp=IFP_OFF&sas=SAME_AS_OFF

In computer science and data mining, MinHash (or the min-wise independent permutations locality sensitive hashing scheme) is a technique for quickly estimating how similar two sets are. The scheme was invented by Andrei Broder, and initially used in the AltaVista search engine to detect duplicate web pages and eliminate them from search results. It has also been applied in large-scale clustering problems, such as clustering documents by the similarity of their sets of words.

Attributes	Values
rdf:type	topical concept yago:WikicatClusteringCriteria yago:Abstraction100002137 yago:Arrangement105726596 yago:Cognition100023271 yago:DataStructure105728493 yago:Function113783816 yago:MathematicalRelation113783581 yago:Measure100033615 yago:PsychologicalFeature100023100 yago:Relation100031921 yago:WikicatHashFunctions yago:Standard107260623 yago:Structure105726345 yago:SystemOfMeasurement113577171 yago:WikicatProbabilisticDataStructures
rdfs:label	MinHaketo (eo) MinHash (es) MinHash (en) 最小哈希 (zh)
rdfs:comment	En komputoscienco, MinHaketo (aŭ la mininuma-sepdependa-permuta loko-zorga haketado) estas tekniko por rapide taksi la similecon de du aroj. La teknikon inventis , kaj oni unue uzis por la serĉilo por detekti kaj forigi kopiojn de retpaĝo el la serĉrezulto. Ĝi ankaŭ uziĝas por granda arigado, ekzemple arigi fajloj per simileco de iliaj enhavaj vortoj. (eo) En ciencia de la computacion, MinHash (o el esquema sensible a localidad que trata permutaciones independientes relativos al mínimo) es una técnica para estimar rápidamente cuan similares son dos conjuntos. El esquema fue inventado por Andrei Broder en 1997 , e inicialmente usado en el motor de búsqueda AltaVista para detectar páginas web duplicadas y eliminarlas de los resultados de búsqueda.También ha sido aplicado en problemas de , tales como por la similitud de las palabras que contienen. (es) In computer science and data mining, MinHash (or the min-wise independent permutations locality sensitive hashing scheme) is a technique for quickly estimating how similar two sets are. The scheme was invented by Andrei Broder, and initially used in the AltaVista search engine to detect duplicate web pages and eliminate them from search results. It has also been applied in large-scale clustering problems, such as clustering documents by the similarity of their sets of words. (en) 在计算机科学领域，最小哈希（或最小哈希式独立排列）方法是一种快速判断两个集合是否相似的技术。这种方法是由（）,发明的，最初在AltaVista搜索引擎中用于在搜索结果中检测并消除重复Web页面。它同样也应用于大规模聚类问题，比如通过文档间包含的词语相似性进行聚类。 (zh)
dcterms:subject	Probabilistic data structures Clustering criteria Hashing Hash functions
Wikipage page ID	30632997 (xsd:integer)
Wikipage revision ID	1092792526 (xsd:integer)
Link from a Wikipage to another Wikipage	Nearest neighbor search Cosine distance Bloom filter Probabilistic data structures Bias of an estimator Permutation Variance Inverse transform sampling Piotr Indyk Count–min sketch Google Google News Linear time Cluster analysis Computer science MapReduce Clustering criteria Data mining W-shingling Disjoint sets Hash function K-independent hashing Locality-sensitive hashing Logical matrix AltaVista Euclidean vector Exponential distribution Probability Random variable Hashing Hamming distance Intersection (set theory) Chernoff bound Hash functions Jaccard index RefSeq Document clustering Association rule learning Simple random sample Similarity measure Union (set theory) Random permutation Universal hashing SimHash Tabulation hashing Locality sensitive hashing
sameAs	MinHash MinHash MinHash MinHash MinHash MinHash MinHash MinHash
dbp:wikiPageUsesTemplate	dbt:Harvtxt dbt:Math dbt:Mvar dbt:Reflist dbt:Short_description dbt:Sqrt dbt:Harvs
authorlink	Andrei Broder (en)
first	Andrei (en)
last	Broder (en)
year	1997 (xsd:integer)
has abstract	En komputoscienco, MinHaketo (aŭ la mininuma-sepdependa-permuta loko-zorga haketado) estas tekniko por rapide taksi la similecon de du aroj. La teknikon inventis , kaj oni unue uzis por la serĉilo por detekti kaj forigi kopiojn de retpaĝo el la serĉrezulto. Ĝi ankaŭ uziĝas por granda arigado, ekzemple arigi fajloj per simileco de iliaj enhavaj vortoj. (eo) En ciencia de la computacion, MinHash (o el esquema sensible a localidad que trata permutaciones independientes relativos al mínimo) es una técnica para estimar rápidamente cuan similares son dos conjuntos. El esquema fue inventado por Andrei Broder en 1997 , e inicialmente usado en el motor de búsqueda AltaVista para detectar páginas web duplicadas y eliminarlas de los resultados de búsqueda.También ha sido aplicado en problemas de , tales como por la similitud de las palabras que contienen. (es) In computer science and data mining, MinHash (or the min-wise independent permutations locality sensitive hashing scheme) is a technique for quickly estimating how similar two sets are. The scheme was invented by Andrei Broder, and initially used in the AltaVista search engine to detect duplicate web pages and eliminate them from search results. It has also been applied in large-scale clustering problems, such as clustering documents by the similarity of their sets of words. (en) 在计算机科学领域，最小哈希（或最小哈希式独立排列）方法是一种快速判断两个集合是否相似的技术。这种方法是由（）,发明的，最初在AltaVista搜索引擎中用于在搜索结果中检测并消除重复Web页面。它同样也应用于大规模聚类问题，比如通过文档间包含的词语相似性进行聚类。 (zh)
gold:hypernym	Technique

Faceted Search & Find service v1.17_git139 as of Feb 29 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3330 as of Mar 19 2024, on Linux (x86_64-generic-linux-glibc212), Single-Server Edition (378 GB total memory, 62 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software