It Is MarkIT That Is New: An Italian Treebank of Marked Constructions
p. 258-264
Résumés
In this paper we present MarkIT, a treebank of marked constructions in Italian, containing around 800 sentences with dependency annotation. We detail the process to extract the sentences and manually correct them. The resource covers seven types of marked constructions plus some ambiguous sentences, whose syntax can be wrongly classified as marked. We also present a preliminary evaluation of parsing performance, comparing a model trained on existing Italian treebanks with the model obtained by adding MarkIT to the training set.
In questo lavoro presentiamo MarkIT, un treebank di costruzioni marcate in italiano che contiene circa 800 frasi annotate con strutture a dipendenze. Abbiamo descritto nel dettaglio il processo seguito per estrarre le frasi e correggerne manualmente la struttura sintassi. La risorsa comprende sette tipologie di costruzioni marcate oltre ad alcune costruzioni ambigue che potrebbero essere classificate erroneamente come marcate. Presentiamo inoltre una valutazione preliminare delle performance del parser in cui confrontiamo un modello allenato sui treebank esistenti dell’italiano con il modello ottenuto aggiungendo anche MarkIT.
Texte intégral
1. Introduction
1In recent years, the goal to develop robust frameworks for consistent annotation of syntactic dependencies across different human languages has led to the creation of Universal Dependencies (UD), an initiative covering nearly 200 treebanks in more than 100 languages. Since UD treebanks are then used to train syntactic parsers, it is important that they account for as many phenomena as possible that can be found in a language, and not only for canonical expressions typically written in news. The purpose to encompass the variety of use in the Italian language has been pursued by including different genres in the VIT treebank (Delmonte, Bristot, and Tonelli 2007) and in ParTUT (Sanguinetti and Bosco 2014) and more recently by including syntactically annotated tweets (Cignarella, Bosco, and Rosso 2019; Sanguinetti et al. 2018) in the UD framework. Overall, seven treebanks are listed under the UD initiative for Italian. In this work, we contribute to this effort by presenting a novel treebank including syntactically annotated marked constructions, which we call MarkIT (MARKed structures Italian Treebank). The samples have been extracted from a corpus of students’ essays and to our knowledge represents the first effort to include in UD a repository of marked structures, which are typical of neo-standard language and are therefore more and more frequent in informal settings (D’Achille 2003). The sentences have been first syntactically parsed and then manually corrected, so that we were also able to analyse which kinds of mistakes are typically done by dependency parsers. The dataset is freely available on Github at https://github.com/dhfbk/markit.
2. Related Work
2In the last years, Universal Depedencies (UD) have become the most widely used standard for syntactic annotation (Marneffe and Manning 2008) upon which treebanks for other languages have been built, including Italian. The first one has been the Italian Stanford Dependency Treebank or ISDT (Bosco, Montemagni, and Simi 2013). Other treebanks have been later built with different purposes, covering a rich collection of different usages and genres. In particular, the VIT treebank (Delmonte, Bristot, and Tonelli 2007) is composed of several texts ranging from news to literature, while TWITTIRO (Cignarella, Bosco, and Rosso 2019) and PoSTWITA (Sanguinetti et al. 2018) are two social-media-based treebanks, composed of tweets. These two Twitter-based treebanks represent an important resource in terms of documentation of the usage of non-standard Italian. We address the same topic in the present work, but instead of considering social-media data, we look at more formal writings, and in particular at the use of marked sentence constructions in students’ essays. To our knowledge, a grammatical UD treebank for Italian language does not exist, and also in other languages there are only few examples. A grammatical treebank is a dataset of annotated trees sharing the same type of grammatical constructions, such as the English Pronouns treebank (Munro 2021), which is the most similar resource to ours. It was created to make independent genitive pronoun’s identification more accurate, by annotating only English sentences which display that construction. For what concerns marked structures in Italian, a comparative study on the distribution of the phenomenon of syntactic markedness has been presented in (Pieri, Brunato, and Dell’Orletta 2016), but the different structures were identified using automated tools. Overall, syntactic markedness is a phenomenon poorly analyzed, especially in the field of dependency grammar. However, it is crucial to make parsers more robust to different syntactic structures.
3. Sentence Collection
3Our goal is to build a treebank of marked constructions that reflects actual usage of Italian, in particular of the neo-standard variant (Berruto 2012). We avoid to manually create sentences ourselves, also to increase linguistic variability. Therefore, we resort to a corpus of students’ essays which were collected by Istituto provinciale Trentino per la Ricerca e la Sperimentazione educativa (IPRASE) with the goal to study the evolution of high-school students’ writing skills, taking into account essays spanning 15 years (from 2001 to 2016). In particular, the project tracked the presence of expressions and constructions typical of neo-standard Italian, requiring a pool of expert annotators, i.e. high-school teachers, to manually mark in essays a number of linguistic traits (Sprugnoli et al. 2018; Tonelli et al. 2020). Among others, annotators were asked to mark dislocated sentences, cleft sentences and hanging topics (see details in Section 4). These were first automatically identified through the TINT NLP Suite (Aprosio and Moretti 2018) and then manually revised by annotators to distinguish between the constructions of interest and other types of similar constructions.
4The final corpus contains more than 2,500 essays and almost 1.5 million tokens. We extract around 800 sentences labeled with a marked structure and annotate them at syntactic level. Although the essays cannot be released because of copyright issues, the sentences in isolation, with no additional information related to the authors or the textual context, can be freely distributed.
5The essays were written in a time span of 15 years by different authors and dealing with a number of different topics, which guarantees a high variability of the sentence content and structure. On the other hand, since they were part of a formal students’ examination, they tend to be free from jargon, grammatical errors and abbreviations that may derive from sentences extracted from social media and that may represent an additional challenge for parsers.
4. Marked Structures in IPRASE Corpus
6With marked sentences we refer to those constructions which present a non-canonical order of constituents. In Italian, the canonical order of the syntactic structure is S V+fin V-fin OX, where S is subject, V+fin is a finite verb or an auxiliary verb, V-fin is a non-finite verb, O is the direct object and X other complements (Benincà, G., and L. 1988). Marked structures are instead intended to focus on an element of the sentence, by moving the focalized constituent in a different position from the one it occupies in a canonical sentence. The reason for markedness in Italian can be phonotactic or bound to the whole meaning of the sentence. In syntactical terms, we can say that the marked structures operate a modification in the distribution of topic and comment with respect to the corresponding non marked structure (Cinque 1990). There are seven possible marked structures in Italian: sentences with postverbal subject, sentences with presentative “there”, sentences with left or right dislocation, hanging topic sentences, cleft sentences and pseudo-cleft sentences (Ferrari and Zampese 2016). Among the sentences from the IPRASE corpus originally marked as dislocated, cleft and hanging topic, we were able to find other types of marked structures which had been wrongly identified by the annotators, so that in the end all seven phenomena are present. Below we report a brief description of the main marked structures annotated in our treebank.
4.1 Left Dislocated Sentences
7Left dislocated sentences entail the displacement or anteposition of a specific syntagm to the left of the sentence. The dislocated element connects with the rest of the sentence thanks to an introductory preposition (1) or a pronominal reprise (2), for which a resumptive clitic pronoun pleonastically co-refers to the displaced nominal element (the topic). The clitic reprise is compulsory whether the displaced element was the direct object, as long as it is in the positive form (Benincà, G., and L. 1988).
(1) A questo evento (ci) partecipano soltanto artisti già noti
To this event (clitic) participate only artists already known
(2) Molto meno successo Eminem lo ha avuto quest’anno
Much less success Eminem it has had this year
4.2 Hanging Topic Sentences
8In hanging topic sentences, similarly to left dislocation, the dislocated element is moved to the left, at the beginning of the sentence. However, in this case, the displaced element is isolated at the beginning of the sentence, and it is not syntactically linked to the verb (D’Achille 2003). The main difference between the two structures is when the dislocated element is the direct object. In fact, since direct objects in Italian exclude prepositional government, only the non-clitic reprise allows the distinction between left dislocated sentences and hanging topics. In hanging topic constructions, the isolated element is always deprived of indicators for its syntactic function, and it is typically reprised in the following phrase by different anaphorical expressions such as atonic pronouns, possessive pronouns, adverbs, and by a whole nominal phrase (3). When there is no reprise of the dislocated element in the subsequent sentence, we refer to that as an example of anacoluthon (Ferrari and Zampese 2016).
(3) [...] ma il cervello, senza di esso non siamo niente
But the brain, without it we are nothing
4.3 Right Dislocated Sentences
9Right dislocated sentences operate a topicalization of the comment and, differently from left dislocated structures, the pronominal reprise is not compulsory when the dislocated element is the direct object. Nevertheless, since the non-marked position of the right dislocated elements is still in postverbal position (apart from the subject), it makes the presence of the anticipatory clitic (4) or of the comma (5) compulsory.
(4) Sono sicuro che molti di voi non lo avevano mai fatto, questo ragionamento
I am sure that many of you do not it have never done, this reasoning
(5) Interessante, in questo senso, la riflessione di Paul
Interesting, in this sense, Paul’s thought
4.4 Cleft Sentences
10Cleft sentences are typically composed of a main clause without a subject introduced by the verb ‘to be’ in different forms, followed by the cleft constituent and by a subordinate clause introduced by “che” (that), whose function can be of relative pronoun (6) or relative conjunction. Sometimes, the subordinate clause can be introduced by “a” (to) + a verb in the infinite form (7), if the subject is the element to put into focus (Berruto and Cerruti 2011). Besides the subject, cleft structures can focalize on several constituents, such as the object, prepositional constituents, adverbs and also verbs, especially in the infinitive form (Renzi 2001).
(6) È lo Stato che [...] impone i suoi modelli
It is the State that […] imposes its models
(7) Non è dunque l’ottica dell’utilità e del guadagno a guidare verso la felicità
It is not then the view of utility and profit to guide to happiness
5. MarkIT Annotation
11Marked structures, such as the ones described above, are very difficult to parse, since they belong to non-standard Italian constructions. In order to annotate them syntactically, we therefore need to follow a semi-automatic approach, by analysing them first with a dependency parser and then manually correcting them. The selected marked constructions from the IPRASE corpus were processed with the TINT parsing module (Aprosio and Moretti 2018), which is built following Universal Dependencies guidelines (Marneffe and Manning 2008), and trained on the Italian Stanford Dependency Treebank, ISDT (Bosco, Montemagni, and Simi 2013). ISDT includes mostly standard language with few non-canonical constructions. The dependency trees parsed by TINT are then manually corrected by an expert linguist using the TINTful interface (Frasnelli, Bocchi, and Palmero Aprosio 2021). They are also marked with one of the categories from Table 1.
12Concerning dislocated sentences, the main issue with TINT is that it assigns to the pronoun the role of direct object and treats the dislocated element as the subject, as in the example shown in Figure 6. The sentence was manually corrected by marking the dislocated element with the dislocated relation and the pronoun of reprise with the core argument relation which it represents (obj or subj), as we can see in Fig. 1 and Fig. 2.
13As previously mentioned, hanging topics differ from left dislocated sentences because the element to the left is not syntactically linked to the verb and there is no clitic reprise of the lexical element. Since there is a sort of isolation of the topicalized element, we choose to use the parataxis relation to link it to the head of the sentence, since parataxis is defined as a relation between a word and other elements, without any explicit coordination, subordination, or argument relation with the head word, which is usually the verb. An example is reported in Figure 3.
14As we have seen above, the “che” (that) before a subordinate clause can be a relative pronoun introducing a relative clause or a relative conjunction, followed by a structure whose nature is controversial. A relative clause is an instance of clausal modifier acl, which takes the specific name of acl:relcl, where the noun can be omitted or substituted by a relative pronoun, relative conjunction, or an adverb. As regards cleft sentences, we choose to use the same relation in two different ways, in order to distinguish between the case in which the cleft sentence comprehends a relative clause or an unspecified subordinate clause. When the dislocated element is the subject or the direct object (substituted by a relative pronoun) we use the acl:relcl relation, selecting the role of “che” (see Fig.4). Instead, if there is no dislocation of the subject or the object, we use the acl relation but we do not select the function of “che”. Indeed, “che” is treated as a mere introducer for the subordinate clause with the mark relation (Fig. 5).
15Table 1 shows the eight types of constructions in MarkIT. Beside the four types of marked structures described above, we found several other structures coming from the erroneous identification of cleft sentences and right dislocated sentences in the original IPRASE corpus. “Presentative there" (i.e. “there” + verb to be + nominal element + that)1 and “pseudo-cleft" structures (“what" clause + verb to be)2 were wrongly identified as cleft sentences, while the structures with a postverbal subject were originally labeled as right dislocated. Furthermore, we include in the “Other" category the structures which resulted challenging to tag for the annotators and which are not “presentative there" nor pseudo-cleft constructions. “Other" structures are namely those which usually present an explicit subject in the main clause and are erroneously identified as cleft, for example La capacità di concepire un insieme di diritti è una facoltà che distingue l’uomo dagli altri esseri viventi (EN: The ability to conceive a set of rights is a faculty that distinguishes humans from other living beings). “Other" structures include also passive clauses, which were originally tagged as right dislocated because of the postverbal position of the subject. Sentences in this last category are particularly challenging both for parsers and for human annotators, since they were wrongly classified even by IPRASE experts (i.e. high-school teachers) and have been assigned the correct label only after our revision.
Table 1: Number of examples in the dataset.
Type | Sents |
Cleft sentences | 309 |
Left dislocated | 121 |
Right dislocated | 49 |
Presentative “there" | 25 |
Postverbal subject | 16 |
Pseudo-clefts | 11 |
Hanging topic | 7 |
Other | 275 |
Total | 813 |
Total (tokens) | 24,623 |
6. Parsing Evaluation
16As already mentioned in the Introduction, the lack of marked structures in treebanks used to train syntactic parsers may affect the system robustness, since structures which are not represented in the training data tend to be poorly analysed. In order to measure the impact of our novel treebank on the dependency analysis of marked structures, we compare the performance of the parser included in TINT, part of Stanford CoreNLP (Manning et al. 2014) by testing it on the new annotated sentences and training on different datasets. In particular, we first split our novel treebank into training, dev, and test, respectively 80%, 10%, and 10%, proportionally with respect to the categories listed in Table 1. When the number of examples is tiny, we include a minimum of two examples for each class in each split, therefore test and dev set contain two examples of hanging topic each, leaving the three sentences for the training set.
17We then compare two models: the original neural transition-based parser model used by TINT, which is trained using ISDT, VIT, and ParTUT (see Section 2), and the model obtained by adding to the above training data also the training set of MarkIT. We choose not to include the other Italian datasets available from Universal Dependencies (such as the ones derived from Twitter) because of their particularly informal language, which is very different from MarkIT sentences taken from students’ essays. In both cases, we use the concatenation of the development sets of the four datasets as development set during the training phase. Following the standard evaluation used in dependency parsing, we compute unlabeled attachment score (UAS) and labeled attachment score (LAS) in the two tests.
18Results in Table 2 show that on the one hand adding MarkIT to the training set improves the classification of marked structures, but on the other hand performance gain is limited. This may be due to the fact that, compared to the other treebanks (more than 23k sentences in total), the number of training instances coming from MarkIT is small (around 650 sentences). More generally, the presence of both marked and not marked sentences (the “Other" category) in the test set represents a challenge for parsers, since very similar constructions are labeled differently, see for example the presence of comma to mark right dislocated elements. Indeed, if the first model is tested only on sentences taken from ISDT+VIT, it achieves 84.47 UAS and 80.69 LAS.
Table 2: Evaluation of the dependency parsing.
Training set | UAS | LAS |
ISDT+VIT+ParTut | 82.53 | 76.62 |
ISDT+VIT+ParTut+MarkIT | 82.74 | 77.41 |
7. Release
19MarkIT is released under CC BY 4.0 license,3 and can be downloaded from Github.4 The annotation of the treebank will be soon completed with all marked sentences in the essays dataset (see Section 8) and proposed for publication on the Universal Dependencies website.5 Since the treebank is still being extended with new sentences, it may be that the content of the last version available online exceeds the size of the resource described in this paper.
8. Conclusions
20In this work we present MarkIT, a novel treebank composed of 800 sentences with syntactic annotation of marked structures. The resource covers seven types of marked sentences, plus around 200 sentences whose structure is not marked but that may be misleading both for parsers and for human annotators. The treebank is made available to the community and is meant to make dependency parsers more robust to the different syntactic structures present in Italian, in particular in the neo-standard variant. The work is still in progress, since we plan to add to the resource other sentences from the IPRASE corpus. Our goal is to include all marked sentences present in the essays, so to analyse also the distribution of the different sentence structures in this type of texts.
Bibliographie
Des DOI sont automatiquement ajoutés aux références bibliographiques par Bilbo, l’outil d’annotation bibliographique d’OpenEdition. Ces références bibliographiques peuvent être téléchargées dans les formats APA, Chicago et MLA.
Format
- APA
- Chicago
- MLA
Alessio Palmero Aprosio, and Giovanni Moretti. 2018. “Tint 2.0: An All-Inclusive Suite for Nlp in Italian.” In Proceedings of the Fifth Italian Conference on Computational Linguistics Clic-It, 10:12.
Salvi G. Benincà, P., and Frison L. 1988. “L’ordine Degli Elementi Della Frase E Le Costruzioni Marcate.” In Grande Grammatica Italiana Di Consultazione. I. La Frase. I Sintagmi Nominale E Preposizionale, edited by L. Renzi, 115–225. Il Mulino.
Gateano Berruto. 2012. Sociolinguistica Dell’italiano Contemporaneo. Carocci.
G.Berruto, and M. Cerruti. 2011. La Linguistica: Un Corso Introduttivo. UTET Università.
Cristina Bosco, Simonetta Montemagni, and Maria Simi. 2013. “Converting Italian Treebanks: Towards an Italian Stanford Dependency Treebank.” In Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, 61–69. Sofia, Bulgaria: Association for Computational Linguistics. https://aclanthology.org/W13-2308.
Cignarella, Alessandra Teresa, Cristina Bosco, and Paolo Rosso. 2019. “Presenting TWITTIRÒ-UD: An Italian Twitter Treebank in Universal Dependencies.” In Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, Syntaxfest 2019), 190–97. Paris, France: Association for Computational Linguistics. https://0-doi-org.catalogue.libraries.london.ac.uk/10.18653/v1/W19-7723.
10.18653/v1/W19-7723 :Cinque, Guglielmo. 1990. Types of a’ Dependencies. MIT Press.
D’Achille, Paolo. 2003. L’italiano Contemporaneo. Il mulino Bologna.
Delmonte, R, A Bristot, and Sara Tonelli. 2007. “VIT-Venice Italian Treebank: Syntactic and Quantitative Features.” In Sixth International Workshop on Treebanks and Linguistic Theories, 1:43–54. Northern European Association for Language Technol.
Ferrari, A., and L. Zampese. 2016. Grammatica: Parole, Frasi, Testi Dell’italiano. Carocci editore.
Frasnelli, Valentino, Lorenzo Bocchi, and Alessio Palmero Aprosio. 2021. “Erase and Rewind: Manual Correction of NLP Output Through a Web Interface.” In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations, 107–13. Online: Association for Computational Linguistics. https://0-doi-org.catalogue.libraries.london.ac.uk/10.18653/v1/2021.acl-demo.13.
10.18653/v1/2021.acl-demo.13 :Manning, Christopher D., Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J. Bethard, and David McClosky. 2014. “The Stanford CoreNLP Natural Language Processing Toolkit.” In Association for Computational Linguistics (Acl) System Demonstrations, 55–60. http://www.aclweb.org/anthology/P/P14/P14-5010.
10.3115/v1/P14-5 :Marneffe, Marie-Catherine de, and Christopher D. Manning. 2008. “The Stanford Typed Dependencies Representation.” In Coling 2008: Proceedings of the Workshop on Cross-Framework and Cross-Domain Parser Evaluation, 1–8. Manchester, UK: Coling 2008 Organizing Committee. https://aclanthology.org/W08-1301.
10.3115/1608858 :Munro, Robert M. 2021. Human-in-the-Loop Machine Learning: Active Learning and Annotation for Human-Centered Ai. Simon; Schuster.
Pieri, Giulia, Dominique Brunato, and Felice Dell’Orletta. 2016. “Studio Sull’Ordine Dei Costituenti Nel Confronto Tra Generi E Complessità (Analysis of Constituents Order Across Textual Genres and Complexity).” In Proceedings of Third Italian Conference on Computational Linguistics (Clic-It 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016), Napoli, Italy, December 5-7, 2016, edited by Pierpaolo Basile, Anna Corazza, Francesco Cutugno, Simonetta Montemagni, Malvina Nissim, Viviana Patti, Giovanni Semeraro, and Rachele Sprugnoli. Vol. 1749. CEUR Workshop Proceedings. CEUR-WS.org. http://ceur-ws.org/Vol-1749/paper44.pdf.
Renzi, L. 2001. “Le Tendenze Dell’italiano Contemporaneo. Note Sul Cambiamento Linguistico Nel Breve Periodo.” Studi Di Lessicografia Italiana, 279–319.
Sanguinetti, Manuela, and Cristina Bosco. 2014. “Converting the Parallel Treebank Partut in Universal Stanford Dependencies.” Converting the Parallel Treebank ParTUT in Universal Stanford Dependencies, 316–21.
10.12871/clicit2014161 :Sanguinetti, Manuela, Fabio Poletto, Cristina Bosco, Viviana Patti, and Marco Stranisci. 2018. “An Italian Twitter Corpus of Hate Speech Against Immigrants.” In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan: European Language Resources Association (ELRA). https://aclanthology.org/L18-1443.
Sprugnoli, Rachele, Tonelli Sara, Alessio Palmero Aprosio, and Moretti Giovanni. 2018. “Analysing the Evolution of Students’ Writing Skills and the Impact of Neo-Standard Italian with the Help of Computational Linguistics.” In CLiC-It 2018 Italian Conference on Computational Linguistics, 354–59. aAccademia University Press.
10.4000/books.aaccademia.2802 :Tonelli, Sara, Rachele Sprugnoli, Alessio Palmero Aprosio, Moretti Giovanni, and Menini Stefano. 2020. “Gli Strumenti Informatici. Sviluppo E Risultati.” In Come Cambia La Scrittura a Scuola: Rapporto Di Ricerca, edited by Michele Ruele and Elvira Zuin, 113–30. IPRASE.
Notes de bas de page
1 e.g. C’è Michela che ti cerca (EN: There is Michela that is looking for you)
2 e.g. Ciò che voglio davvero è che tu te ne vada (EN: What I really want is that you go)
3 https://bit.ly/cc-by-40-intl
Auteurs
Fondazione Bruno Kessler, Via Sommarive 18, Trento (Italy) – tpaccosi@fbk.eu
Fondazione Bruno Kessler, Via Sommarive 18, Trento (Italy) – aprosio@fbk.eu
Fondazione Bruno Kessler, Via Sommarive 18, Trento (Italy) – satonelli@fbk.eu
Le texte seul est utilisable sous licence Creative Commons - Attribution - Pas d'Utilisation Commerciale - Pas de Modification 4.0 International - CC BY-NC-ND 4.0. Les autres éléments (illustrations, fichiers annexes importés) sont « Tous droits réservés », sauf mention contraire.
Proceedings of the Second Italian Conference on Computational Linguistics CLiC-it 2015
3-4 December 2015, Trento
Cristina Bosco, Sara Tonelli et Fabio Massimo Zanzotto (dir.)
2015
Proceedings of the Third Italian Conference on Computational Linguistics CLiC-it 2016
5-6 December 2016, Napoli
Anna Corazza, Simonetta Montemagni et Giovanni Semeraro (dir.)
2016
EVALITA. Evaluation of NLP and Speech Tools for Italian
Proceedings of the Final Workshop 7 December 2016, Naples
Pierpaolo Basile, Franco Cutugno, Malvina Nissim et al. (dir.)
2016
Proceedings of the Fourth Italian Conference on Computational Linguistics CLiC-it 2017
11-12 December 2017, Rome
Roberto Basili, Malvina Nissim et Giorgio Satta (dir.)
2017
Proceedings of the Fifth Italian Conference on Computational Linguistics CLiC-it 2018
10-12 December 2018, Torino
Elena Cabrio, Alessandro Mazzei et Fabio Tamburini (dir.)
2018
EVALITA Evaluation of NLP and Speech Tools for Italian
Proceedings of the Final Workshop 12-13 December 2018, Naples
Tommaso Caselli, Nicole Novielli, Viviana Patti et al. (dir.)
2018
EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020
Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian Final Workshop
Valerio Basile, Danilo Croce, Maria Maro et al. (dir.)
2020
Proceedings of the Seventh Italian Conference on Computational Linguistics CLiC-it 2020
Bologna, Italy, March 1-3, 2021
Felice Dell'Orletta, Johanna Monti et Fabio Tamburini (dir.)
2020
Proceedings of the Eighth Italian Conference on Computational Linguistics CliC-it 2021
Milan, Italy, 26-28 January, 2022
Elisabetta Fersini, Marco Passarotti et Viviana Patti (dir.)
2022