Tuesday, July 18, 2006

 

RTE 2 Preprocessed Dataset Format

Text'i cumlelere ayirmak icin MXTERMINATOR kullanilmis.

Her bir cumleyi parse etmek icin de MINIPAR kullanilmis.

Cumlelerdeki her bir "node" (noktalama isaretleri dahil) su attribute'lara sahip:
- node (e.g. 7)
- word (e.g. "declined")
- lemma (e.g. "decline")
- pos (e.g. v)
- relation (e.g. i) (bkz. MINIPAR Grammatical Categories)
- parent node (for the relation) (e.g. E3)

Parent Node iliskisi kullanilarak tree yapisi elde edilebilir.

RTE1 Dataset / Development: http://www.cs.biu.ac.il/~glikmao/dev2.zip
RTE1 Dataset / Test: http://www.cs.biu.ac.il/~glikmao/test.zip
RTE1 Dataset / Annotated: http://www.cs.biu.ac.il/~glikmao/annotated_test.zip

RTE 2 Dataset: http://ir-srv.cs.biu.ac.il:64080/RTE2/PASCAL/index.php?action=download

Comments: Post a Comment



<< Home

This page is powered by Blogger. Isn't yours?