Snajder, J.; Almic, P.: Modeling semantic compositionality of Croatian multiword expressions (2015)
0.00
0.0027115175 = product of:
0.024403658 = sum of:
0.024403658 = weight(_text_:data in 2920) [ClassicSimilarity], result of:
0.024403658 = score(doc=2920,freq=2.0), product of:
0.11642061 = queryWeight, product of:
3.1620505 = idf(docFreq=5088, maxDocs=44218)
0.036818076 = queryNorm
0.2096163 = fieldWeight in 2920, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1620505 = idf(docFreq=5088, maxDocs=44218)
0.046875 = fieldNorm(doc=2920)
0.11111111 = coord(1/9)
- Content
- Vgl. unter: http://takelab.fer.hr/data/cromwesc/. The dataset is available from here: TakeLab-CroMWEsc.tar.gz. The archive contains one file, which contains a list of 200 Croatian multiword expressions annotated with semantic compositionality scores. Twenty expressions were annotated by 24 annotators (denoted by "*") and the rest of them were annotated by 6 annotators. Besides median, we provide mode, mean, and standard deviation for each expression. Consult the above mentioned paper for details.