measuring similarity between documents using jaccard coefficient

Posted by jaskirat on Stack Overflow See other posts from Stack Overflow or by jaskirat
Published on 2010-05-19T20:17:06Z Indexed on 2010/05/19 20:20 UTC
Read the original article Hit count: 285

Filed under:
|
|

hi i m finding similarity between documents ....nd to measure that i used jaccard coefficient...i did like dis

D1=(8,0,0,1) where 8,0,0,1 are the tf-idf scores of the terms t1, t2, t3 , t4

D2=(7,0,0,0)

jaccard coefficient= dotproduct(d1,d2) / |d1|+|d2|-dotproduct(d1,d2)

and the answer comes out to be " -1.367931 "...what does it signify about the similarity between the documents...pls do reply..please...thank u..

© Stack Overflow or respective owner

Related posts about document

Related posts about similarity