RT info:eu-repo/semantics/article T1 Pragmatic annotation of a domain-restricted English-Spanish comparable corpus A1 Sanjurjo González, Hugo A1 Ramón García, Noelia A1 Rabadán Álvarez, Rosa A2 Filologia Inglesa K1 Lengua inglesa K1 Semantic annotation K1 Pragmatic annotation K1 Comparable corpus K1 Regular expressions K1 English/Spanish K1 5701.11 Enseñanza de Lenguas AB [EN] This paper explores the multi-layer annotation of a written domain-restricted English-Spanish comparable corpus (CLANES – Controlled LANguage English Spanish), focusing on pragmatic annotation. The annotation scheme draws on part of speech tagging and a semantic annotation scheme, i.e. the UCREL Semantic Analysis System, with some added categories to fit the food-and-drink domain represented in CLANES. These are used to build significant (pragmatic) metapatterns. Seven different pragmatic functions have been identified in our corpus, namely , , , , , and . Computer scripts translate this linguistic information into regular expressions to be used in unsupervised annotation. Partial results indicate that applying lexical restrictors boosts the success rate considerably. However, metadata is preferred because of increased replicability and generality. Replicability issues and limitations encountered during testing are also addressed. PB University of Bergen LK https://hdl.handle.net/10612/20503 UL https://hdl.handle.net/10612/20503 NO Rabadán, R., Ramón, N., & Sanjurjo-González, H. (2021). Pragmatic annotation of a domain-restricted English-Spanish comparable corpus. Bergen Language and Linguistics Studies, 11(1), 209-223. https://doi.org/10.15845/BELLS.V11I1.3445 DS BULERIA. Repositorio Institucional de la Universidad de León RD 03-jun-2024