measuring similarity between documents using jaccard coefficient
Posted
by jaskirat
on Stack Overflow
See other posts from Stack Overflow
or by jaskirat
Published on 2010-05-19T20:17:06Z
Indexed on
2010/05/19
20:20 UTC
Read the original article
Hit count: 282
hi i m finding similarity between documents ....nd to measure that i used jaccard coefficient...i did like dis
D1=(8,0,0,1) where 8,0,0,1 are the tf-idf scores of the terms t1, t2, t3 , t4
D2=(7,0,0,0)
jaccard coefficient= dotproduct(d1,d2) / |d1|+|d2|-dotproduct(d1,d2)
and the answer comes out to be " -1.367931 "...what does it signify about the similarity between the documents...pls do reply..please...thank u..
© Stack Overflow or respective owner