> what does the cluster score represent? and Is there a normalized version from
> the score? if not how can I normalize it?
Each algorithm has its own notion of a cluster's score, so it does not
have any natural representation. You can assume a higher score
indicates higher reliability in the cluster's content, but this is
only partially accurate (for example, the score incorporates the
cluster's size in STC).
Cluster scores need not be normalized, although for STC and Lingo I
think they are (would have to check the code). You can always do
post-processing normalization (min-max).
> Can I get what is called the label score? if yes, how? what does it represents?
Cluster score and label score are equivalent. I don't know what you mean.
> how can I set the Label_Similarity_Threshold LINGO parameter?
For setting attributes see UsingAttributes.java example:
But I don't know which attribute you have in mind -- can you point at
the source code or documentation where you encountered this attribute?
I guess you meant an attribute that was there in the original algorithm (described in the paper and available in Carrot2 2.x). Lingo has been updated since that time, and that attribute got replaced with a number of others. Take a look at this thread for more information: