Class AxiomaticF2EXP
java.lang.Object
org.apache.lucene.search.similarities.Similarity
org.apache.lucene.search.similarities.SimilarityBase
org.apache.lucene.search.similarities.Axiomatic
org.apache.lucene.search.similarities.AxiomaticF2EXP
F2EXP is defined as Sum(tfln(term_doc_freq, docLen)*IDF(term)) where IDF(t) = pow((N+1)/df(t), k)
 N=total num of docs, df=doc freq
- WARNING: This API is experimental and might change in incompatible ways in the next release.
- 
Nested Class SummaryNested classes/interfaces inherited from class org.apache.lucene.search.similarities.SimilaritySimilarity.SimScorer
- 
Field Summary
- 
Constructor SummaryConstructorsConstructorDescriptionDefault constructorAxiomaticF2EXP(float s) Constructor setting s only, letting k and queryLen to defaultAxiomaticF2EXP(float s, float k) Constructor setting s and k, letting queryLen to default
- 
Method SummaryModifier and TypeMethodDescriptionprotected doublegamma(BasicStats stats, double freq, double docLen) compute the gamma componentprotected doubleidf(BasicStats stats, double freq, double docLen) compute the inverted document frequency componentprotected ExplanationidfExplain(BasicStats stats, double freq, double docLen) Explain the score of the inverted document frequency component for a single documentprotected doubleln(BasicStats stats, double freq, double docLen) compute the document length componentprotected ExplanationlnExplain(BasicStats stats, double freq, double docLen) Explain the score of the document length component for a single documentprotected doubletf(BasicStats stats, double freq, double docLen) compute the term frequency componentprotected ExplanationtfExplain(BasicStats stats, double freq, double docLen) Explain the score of the term frequency component for a single documentprotected doubletfln(BasicStats stats, double freq, double docLen) compute the mixed term frequency and document length componentprotected ExplanationtflnExplain(BasicStats stats, double freq, double docLen) Explain the score of the mixed term frequency and document length component for a single documenttoString()Name of the axiomatic method.Methods inherited from class org.apache.lucene.search.similarities.Axiomaticexplain, explain, scoreMethods inherited from class org.apache.lucene.search.similarities.SimilarityBasefillBasicStats, log2, newStats, scorerMethods inherited from class org.apache.lucene.search.similarities.SimilaritycomputeNorm, getDiscountOverlaps
- 
Constructor Details- 
AxiomaticF2EXPpublic AxiomaticF2EXP(float s, float k) Constructor setting s and k, letting queryLen to default- Parameters:
- s- hyperparam for the growth function
- k- hyperparam for the primitive weighting function
 
- 
AxiomaticF2EXPpublic AxiomaticF2EXP(float s) Constructor setting s only, letting k and queryLen to default- Parameters:
- s- hyperparam for the growth function
 
- 
AxiomaticF2EXPpublic AxiomaticF2EXP()Default constructor
 
- 
- 
Method Details- 
toStringDescription copied from class:AxiomaticName of the axiomatic method.
- 
tfcompute the term frequency component
- 
lncompute the document length component
- 
tflncompute the mixed term frequency and document length component
- 
idfcompute the inverted document frequency component
- 
gammacompute the gamma component
- 
tfExplainDescription copied from class:AxiomaticExplain the score of the term frequency component for a single document
- 
lnExplainDescription copied from class:AxiomaticExplain the score of the document length component for a single document
- 
tflnExplainDescription copied from class:AxiomaticExplain the score of the mixed term frequency and document length component for a single document- Specified by:
- tflnExplainin class- Axiomatic
- Parameters:
- stats- the corpus level statistics
- freq- number of occurrences of term in the document
- docLen- the document length
- Returns:
- Explanation of how the tfln component was computed
 
- 
idfExplainDescription copied from class:AxiomaticExplain the score of the inverted document frequency component for a single document- Specified by:
- idfExplainin class- Axiomatic
- Parameters:
- stats- the corpus level statistics
- freq- number of occurrences of term in the document
- docLen- the document length
- Returns:
- Explanation of how the idf component was computed
 
 
-