Tuesday, July 18, 2006
Pascal RTE 1 Used Techniques
- word overlap between T and H, possibly including stemming, lemmatization,
part of speech tagging, and applying statistical word weighting such as idf.
(for example using simple decision tree trained by the development set: accuracy = 56%)
- consider relationships between words that may reflect entailment, based either on statistical methods or WordNet.
- measure degree of match between T and H, based on some distance criteria.
- few systems incorporated some form of "world knowledge."
- some of them applyied a logical prover.
- other techniques used are:
- probabilistic models
- probabilistic Machine Translation models
- supervised learning methods
- logical inference
- various specific scoring mechanisms
For more information:
The PASCAL Recognising Textual Entailment Challenge: http://www.cs.biu.ac.il/~glikmao/rte05/dagan_et_al.pdf
part of speech tagging, and applying statistical word weighting such as idf.
(for example using simple decision tree trained by the development set: accuracy = 56%)
- consider relationships between words that may reflect entailment, based either on statistical methods or WordNet.
- measure degree of match between T and H, based on some distance criteria.
- few systems incorporated some form of "world knowledge."
- some of them applyied a logical prover.
- other techniques used are:
- probabilistic models
- probabilistic Machine Translation models
- supervised learning methods
- logical inference
- various specific scoring mechanisms
For more information:
The PASCAL Recognising Textual Entailment Challenge: http://www.cs.biu.ac.il/~glikmao/rte05/dagan_et_al.pdf