DFG-Project: Actionality classes and cross-linguistic coding tendencies. Typological research and development of an analysis software tool

This project will document and explain a grammatical universal, i.e. cross-linguistic coding tendencies in verbs with actional classes and aspect classes by demonstrating a link between cross-linguistic patterns of language form and general trends of language use. Essential components of the analysis of this universal will be carried out by an analysis software tool which will be developed during this project.
The claim of the proposal is that frequently expressed meanings tend to be expressed by zero-coded forms (form-frequency correspondences). The quantities “frequency” and “form” are components of Zipf’s law (1935) which will be pursued by extending the frequency-form relation to a frequency-form/function-relation, since the coding asymmetries express distinctions between actional and aspect classes of verbs and have semantic implications: With zero-coding, telic verbs tend to express perfective aspect and atelic verbs imperfective aspect, respectively, with overt-coding it is the other round. The form (coding)-frequency-correspondence will be examined inspecting the asymmetry in the coding of atelic/telic verbs in a number of European and non-European languages. In general, combinations that occur more frequently tend to be zero-coded overtly across languages, while the combinations that occur more rarely tend to be coded overtly. The proposed explanation is that higher-frequency and thus higher probability items are more predictable than lower-frequency items, and predictable content need not be expressed overtly or can be expressed by shorter forms. The hypothesis is that frequency is just one of a set of factors that constitute predictability, and it is aimed at disclosing and weighting these factors. Form-frequency correspondences make language structure more efficient (Zipf 1949), but it still needs to be shown that there exists a mechanism that creates and maintains these efficient structures: recurrent instances of language change driven by the speakers’ preference for user-friendly utterances. The project thus combines cross-linguistic research on grammar and cross-linguistic corpus research. Form-frequency correspondences are still largely overlooked and ignored by linguists, so the current project will have a significant impact on our general understanding of human language.
The technological output of this project will be analysis software tool. In order to disclose components of “predictability”, the tool provides a couple of statistical entropy based and probability based techniques. This part of the project comprises not only software programming but also evaluation and interpretation of the results and contributes essentially to theory forming. The results will allow for deeper insights what we mean when talking about predictably of coding asymmetries.
Future research will deal with the project’s topics by means of formalisms such as stochastic optimality theory (Bresnan et al. 2001) or evolutionary game theory (Jäger 2007).

News

  • Max Koelbl posted on November 4, 2019

    At IntelliSys 2019 in London (5.-6. September), our paper Estimation of Average Information Content: Comparison of impact of contexts. Intelligent Systems and Applications was presented.

  • Max Koelbl posted on June 20, 2019

    At BIS 2019, our paper Interaction of Information Content and Frequency as Predictors of Verbs' Lengths will be presented

  • Max Koelbl posted on May 22, 2019

    Talk Surprisal in Texten: Der Shannonsche Informationsgehalt als Merkmal für Text-Zusammenfassungen on 22 May 2019 at Hochschule Anhalt.

  • Max Koelbl posted on March 4, 2019

    Past events

    January 2018 (25)

    January 2018, 25th (Thursday), 9:00-12:30h: Informal mini-workshop on tense-aspect typology and coding asymmetries, with presentations by Martin Haspelmath, Olav Mueller-Reichau, Natalia Levshina, Michael Richter

  • Max Koelbl posted on November 9, 2018

    The list of abstracts of the workshop are available now.

  • Dr. Michael Richter posted on October 18, 2018

    Leipzig workshop on "Language Universals, typology and corpus-based Research", date: 14th. of November; time: 10 am - 6pm; venue: Mediencampus in Gohlis Mediencampus

  • Max Koelbl posted on September 1, 2018

    At KONVENS, the paper Aspect coding asymmetries of verbs: The case of Russian will be presented.

The Team

Prof. Dr. Gerhard Heyer

Prof. Dr. Gerhard Heyer

See profile
Dr. Michael Richter

Dr. Michael Richter

See profile
Yuki Kyogoku

Yuki Kyogoku

See profile
Tariq Yousef

Tariq Yousef

See profile
Max Koelbl

Max Koelbl

Publications

  • Kölbl, M.; Kyogoku, Y.; Philipp, J.; Richter, M.; Rietdorf, C. and Yousef, T. (2020). Keyword Extraction in German: Information-theory vs. Deep Learning.In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI, ISBN 978-989-758-395-7, ISSN 2184-433X, pages 459-464. DOI: 10.5220/0009374704590464
  • Michael Richter, Giuseppe Celano. Aspectual coding asymmetries: predicting aspectual verb lengths by the effects frequency and information content. To appear in Topics in Linguistics.
  • Michael Richter, Tariq Yousef. 2019. Predicting default and non-default aspectual coding: Impact and density of information features. Preliminary proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019): Kaleidoscope Abstracts. 275 – 277.
  • Richter, M. Kyogoku, Y., Kölbl, M., Interaction of Information Content and Frequency as Predictors of Verbs’ Lengths. In Witold Abramowicz, Rafael Corchuelo, Business Information Systems 22nd International Conference, BIS 2019, Seville, Spain, June 26–28, 2019, Proceedings, Lecture Notes in Business Information Processing 353. Springer, 271 – 282. (https://doi.org/10.1007/978-3-030-20485-3).
  • Richter, M., Kyogoku, Y., Kölbl, M., “Estimation of Average Information Content: Comparison of impact of contexts". Intelligent Systems and Applications - Proceedings of the 2019 Intelligent Systems Conference, IntelliSys 2019, London, UK, September 5-6, 2019, Volume 2. Advance in Intelligent Systems and Computing 1038. Springer 2020, 1251 – 1257. (https://doi.org/10.1007/978-3-030-29513-4_91).
  • Celano, G., Richter, M, Voll, R., Heyer, H. (2018). Aspect coding asymmetries of verbs: The case of Russian. In Barbaresi, A., Biber, H., Neubarth, F., Osswald, R. (eds.), KONVENS 2018. PROCEEDINGS of the 14th Conference on Natural Language Processing, 34 – 39.

Papers