Saturday, July 22, 2006

 

RTE 2 Dataset icin Parser yazdim (Python'da)

Artik RTE 2 dataset'ini parse edebiliyoruz:

Yani elimizde her bir T-H (Text-Hypothesis) ikilisi icin:

(raw data)
THPair:
- id
- entailment: TRUE/FALSE
- task: QA/SUM/IE/IR
- text: "... ... ... ... .."
- hypothesis: ".. .. ..."

veya

(preprocessed data)
XTHPair:
-id
- entailment: TRUE/FALSE
- task: QA/SUM/IE/IR
- text:
- hypothesis:
-------------------

XSentence:
- serial (in other words index within the context)
- nodes:
-------------------

XNode:
- id (unique within the sentence)
- word
- lemma
- pos
- relation:
Bazi yapilarin onunde 'X' var. Bu, o yapinin preprocessed dataset'ten olusturuldugunu gosteriyor.

Relation'daki 'parent node', cumleleri bir parse tree seklinde ifade edebilmemizi sagliyor.

Comments: Post a Comment



<< Home

This page is powered by Blogger. Isn't yours?