RST Continuity Corpus

Description

The RST Continuity Corpus was developed at Åbo Akademi University and Humboldt-Universität zu Berlin and contains annotations for continuity dimensions added to the RST Discourse Treebank. The RST Discourse Treebank is a collection of English news texts from the Penn Treebank annotated for rhetorical relations under the RST (Rhetorical Structure Theory) framework. In the RST Continuity Corpus, the relations are annotated for the seven continuity dimensions: time, space, reference, action, perspective, modality, and speech act. The relations are also annotated for polarity, order of segments, nuclearity, and context.
Show more

Year of publication

2024

Type of data

Authors

Humboldt-Universität zu Berlin

Markus Egg - Creator

Linguistics Data Consortium - Publisher

Debopam Das Orcid -palvelun logo - Creator

Project

Other information

Fields of science

Languages

Language

Open access

Open

License

Creative Commons Attribution 4.0 International (CC BY 4.0)

Keywords

Subject headings

Temporal coverage

undefined

Related to this research data