Difference between revisions of "Main Page"

From TermiKnowledge
Jump to navigation Jump to search
Line 3: Line 3:
It is an educational project within the framework of the 4EU+ Alliance running in the academic year 2021–2022 for students of the University of Warsaw, Charles University in Prague, Heidelberg University and Milan University.
It is an educational project within the framework of the 4EU+ Alliance running in the academic year 2021–2022 for students of the University of Warsaw, Charles University in Prague, Heidelberg University and Milan University.


<div class="wrap" style="column-count:2">
See more:
=== About the Project ===
Our team consisted of 7 [[Instructors|teachers]] and more than 30 [[Students|student participants]] (7–8 students per university + 1 Student Assistant per university).


Our goal in the 1<sup>st</sup> semester was to compile a multilingual knowledge base on COVID-19. Following a series of six weekly lectures delivered by the teachers and spanning such topics as terminology science, corpus linguistics and lexicography, we agreed on a list of 40 key concepts and worked to provide terms in the four languages of the countries where our universities are (Czech, German, Italian and Polish) and in English. Knowledge base entries were based exclusively on text excerpted from corpora in the five languages, nearly all of which were compiled by project participants.
* [[About TermiKnowledge]]
* [[Information about the corpora|Technical parameters of the corpora]]
* [[Students|Student participants]]
* [[Instructors]]


Four corpora were created for each language, comprising four different types of texts related to COVID-19. One type was normative texts (legal regulations such as EU directives, national regulations, and medical guidelines), another was research texts (published research papers), then press texts (general press, not specialized medical journals) and finally texts found online in comments sections under press articles.  The purpose here was to take a multifaceted “snapshot” of the linguistic landscape of COVID-19. Thus, there were a total of 20 corpora altogether. Not all concepts were represented by terms in all types of corpora.
= Entry lists =


Of the 20 corpora, 19 were compiled by our participants. However, instead of compiling our own corpus of research texts in English, we used a huge corpus available on the Sketch Engine website, consisting of texts that were released as part of the COVID-19 Open Research Dataset (CORD-19).
==English entries ==
 
<div class="wrapper" style="column-count:3">
<small>''Reference: COVID-19 Open Research Dataset (CORD-19). 2020. Version 2020-05-02. Retrieved from https://pages.semanticscholar.org/coronavirus-research.&#x20;doi:10.5281/zenodo.3715505<nowiki/>.''</small>
*[[Term:Anti-vaxxer|Anti-vaxxer]]
 
*[[Term:Antibody|Antibody]]
Some technical parameters regarding our corpora can be found [[Information about the corpora|here]].
*[[Term:Asymptomatic|Asymptomatic]]
 
* [[Term:Big pharma|Big pharma]]
=== Entry Structure ===
*[[Term:Booster shot|Booster shot]]
Each entry in a given language combines information from up to four corpora. Therefore, entries are generally subdivided into four sections (Normative, Press, Research & Comments) corresponding to the different corpora. If a corpus did not contain any single- or multiword terms or expressions (headword, variants and synonyms) reflecting a given concept, a note so stating can be seen, and the usual field headings are not present.
* [[Term:COVID-19|COVID-19]]
 
*[[Term:COVID-19 pandemic|COVID-19 pandemic]]
While most headwords are terms, i.e. units of language naming concepts that are part of the conceptual system of a given field of specialized knowledge or activity, some (such as ''anti-vaxxer'') are obviously non-terminological expressions used in general language<ins>.</ins>
*
 
[[Term:Close contact|Close contact]]
Below the headword there is a list of related terms with a specification of the relation in each case. The specifications are written in natural language. We did not use a semiotic code.
*[[Term:Coronavirus vaccine|Coronavirus vaccine]]
 
*[[Term:Covidiot|Covidiot]]
The fields within each section relate to conceptual and linguistic aspects of the headword.
*[[Term:Digital COVID certificate|Digital COVID certificate]]
 
*[[Term:Dose|Dose]]
The conceptual domain is reflected in definitions (which may include encyclopaedic information) and descriptions. Definitions, with encyclopaedic information added where possible, are given in the Normative section. It was assumed that any description of the entry term following the rules prescribed for intensional (hypernym + distinctive features) or extensional (list of types of objects covered) definitions would represent good material for inclusion in the Definition field while all other information describing a term would be considered encyclopaedic information. The Research section includes a Definition/Description field, because we expected that it would be difficult to find formal definitions in research papers, but that scientific descriptions of concepts could be excerpted from those corpora. The Press and Comments sections include just a Description field, where entry authors would place any definition-like quotations. You will notice that the conceptual fields are sometimes skipped, which is because entry authors could not find suitable quotations to use as a definition or description. This may be partly due to the rather limited size of a number of our corpora, but also partly stem from the nature of a particular entry headword.
*[[Term:Face mask|Face mask]]
 
*[[Term:Hospitalization|Hospitalization]]
All definitions, descriptions and pieces of encyclopaedic information are direct quotations from texts in the respective corpora. The quotations were not altered in any way. We also did not prepare our own definitions that summarized what we had read. Thus, our approach to definition mining was exclusively corpus-driven.
* [[Term:Infection rate|Infection rate]]
 
*[[Term:Intensive care|Intensive care]]
Some of our headwords were obviously not terms, for example emotion-laden units like ''anti-vaxxer'' or ''covidiot'', but this was not an obstacle to finding quotations that described their meaning. By way of experiment, we also included ''negative'' and ''positive'' as headwords, even though these adjectives are classified as term elements rather than standalone terms. The compilers noted that these two behaved differently in texts and not all uses were related to COVID-19, but this proved no obstacle to the successful generation of entries.
*[[Term:Lockdown|Lockdown]]
 
* [[Term:MRNA vaccine|mRNA vaccine]]
The linguistic fields begin with Variants and Synonyms, which are found above the Definition/Description field. We decided not to distinguish between (spelling, morphological, etc.) variants and true synonyms.
*[[Term:Mechanical ventilation|Mechanical ventilation]]
 
*[[Term:Negative|Negative]]
All synonyms and variants can be your starting point for accessing entries through a redirection mechanism.
*[[Term:Positive|Positive]]
 
*[[Term:Proof of vaccination|Proof of vaccination]]
The remaining fields providing linguistic information comprise Examples, followed by Collocations and an indication of the Keyness (see below) of the headword and any variants or synonyms attested in a given corpus.
*[[Term:Quarantine|Quarantine]]
 
*[[Term:RT-PCR test|RT-PCR test]]
Sentences that served as Examples were selected from among those that did not quite qualify as definitions/descriptions, for example, because they did not contain general information. The presence of frequent collocations also played a role.
*[[Term:Rapid diagnostic test|Rapid diagnostic test]]
 
*[[Term:Restrictions|Restrictions]]
Collocations were included on the basis of their frequency. Although steps were taken to ensure a unified layout, particularly of collocations with verbs to distinguish subject and object collocations, some liberty was allowed in that respect, too.
*[[Term:SARS-CoV-2|SARS-CoV-2]]
 
*[[Term:Severe course of COVID-19|Severe course of COVID-19]]
==== The Keyness Field ====
*[[Term:Social distancing|Social distancing]]
The values in the Keyness field refer to the position of a particular term or expression on the keyness list for a particular corpus. Note that Sketch Engine generates two keyword lists, separately for single words and potential multiword expressions, and thus there were two keyness lists for each corpus. That is why the keyness figure given for a single-word keyword may be higher (which generally means that its frequency was lower) than the figure for a multiword expression including this headword, since the keyness data were derived from a different list.
*[[Term:Symptomatic|Symptomatic]]
 
* [[Term:Vaccinate|Vaccinate]]
The reference corpora were the default national web-crawl corpora suggested by Sketch Engine.
*[[Term:Vaccination programme|Vaccination programme]]
 
*[[Term:Vaccination rate|Vaccination rate]]
We encountered a number of problems with keyness lists. In the case of highly inflected languages (Czech and Polish), some lexemes were not lemmatized by Sketch Engine, and so the occurrences of the different paradigm forms (cases, tense-person-number forms, etc.) were counted separately, the result being, obviously, that the associated keyness figures were rather low (high positions on keyness lists). Keyness positions are nevertheless provided in view of their potential utility for making comparisons between synonyms, terms or languages.</div>
*[[Term:Vaccine mandate|Vaccine mandate]]
*[[Term:Variant of SARS-CoV-2|Variant of SARS-CoV-2]]
*[[Term:Ventilator|Ventilator]]
*[[Term:Virus transmission|Virus transmission]]
*[[Term:Wave of pandemic|Wave of pandemic]]</div>
==Czech entries ==
<div class="wrapper" style="column-count:3">*[[Term:Rapid diagnostic test/cz|Antigenní test]]
*[[Term:Anti-vaxxer/cz|Anti-vaxxer/cz]]
*[[Term:Asymptomatic/cz|Asymptomatický]]
*[[Term:Close contact/cz|Blízký kontakt]]
*[[Term:COVID-19/cz|COVID-19]]
*[[Term:Covidiot/cz|Covidiot]]
*[[Term:Dose/cz|Dávka]]
*[[Term:Digital COVID certificate/cz|Digitální certifikát EU COVID]]
*[[Term:Hospitalization/cz|Hospitalizace]]
*[[Term:Intensive care/cz|Jednotka intenzivní péče]]
*[[Term:Quarantine/cz|Karanténa]]
*[[Term:Lockdown/cz|Lockdown]]
*[[Term:Mechanical ventilation/cz|Mechanická ventilace]]
*[[Term:MRNA vaccine/cz|mRNA vakcína]]
*[[Term:Proof of vaccination/cz|Národní certifikát o provedeném očkování]]
*[[Term:Negative/cz|Negativní]]
*[[Term:Vaccination programme/cz|Očkovací schéma]]
*[[Term:Vaccinate/cz|Očkovat]]
*[[Term:Social distancing/cz|Odstup]]
*[[Term:Restrictions/cz|Opatření]]
*[[Term:Positive/cz|Pozitivní]]
*[[Term:Antibody/cz|Protilátka]]
*[[Term:COVID-19 pandemic/cz|Pandemie COVID-19]]
*[[Term:Booster shot/cz|Posilovací dávka]]
*[[Term:Vaccine mandate/cz|Povinné očkování]]
*[[Term:Vaccination rate/cz|Proočkovanost]]
*[[Term:Face mask/cz|Respirátor]]
*[[Term:RT-PCR test/cz|RT-PCR test]]
*[[Term:SARS-CoV-2/cz|SARS-CoV-2]]
* [[Term:Infection rate/cz|Šíření nákazy]]
*[[Term:Virus transmission/cz|Šíření viru]]
*[[Term:Infection rate/cz|Symptomatický]]
* [[Term:Severe course of COVID-19/cz|Těžký průběh COVID-19]]
* [[Term:Coronavirus vaccine/cz|Vakcína proti COVID-19]]
*[[Term:Variant of SARS-CoV-2/cz|Varianta SARS-CoV-2]]
*[[Term:Ventilator/cz|Ventilátor]]
*[[Term:Wave of pandemic/cz|Vlna]]</div>
==Deutsche Artikel==
<div class="wrapper" style="column-count:3">
*[[Term:Antibody/de|Antikörper]]
*[[Term:Asymptomatic/de|Asymptomatisch]]
*[[Term:Booster shot/de|Auffrischimpfung]]
*[[Term:Virus transmission/de|Beatmungsgerät]]
*[[Term:Restrictions/de|Beschränkungen]]
*[[Term:Big pharma/de|Big Pharma]]
*[[Term:COVID-19/de|Covid-19]]
*[[Term:Coronavirus vaccine/de|Covid-19-Impfung]]
*[[Term:COVID-19 pandemic/de|Covid-19-Pandemie]]
*[[Term:Wave of pandemic/de|Covid-19-Welle]]
* [[Term:Digital COVID certificate/de|Digitales COVID-Zertifikat]]
*[[Term:Dose/de|Dosis]]
*[[Term:Close contact/de|Enger Kontakt]]
*[[Term:Hospitalization/de|Hospitalisierung]]
*[[Term:Vaccinate/de|Impfen]]
*[[Term:Anti-vaxxer/de|Impfgegner]]
*[[Term:Vaccination programme/de|Impfkampagne]]
* [[Term:Proof of vaccination/de|Impfnachweis]]
*[[Term:Vaccine mandate/de|Impfpflicht]]
*[[Term:Vaccination rate/de|Impfquote]]
* [[Term:Infection rate/de|Infektionsrate]]
*[[Term:Intensive care/de|Intensivmedizinische Behandlung]]
*[[Term:Lockdown/de|Lockdown]]
*[[Term:Mechanical ventilation/de|Mechanische Beatmung]]
* [[Term:Face mask/de|Medizinische Gesichtsmaske]]
*[[Term:MRNA vaccine/de|mRNA-Impfstoff]]
*[[Term:Negative/de|Negativ]]
*[[Term:RT-PCR test/de|PCR-Test]]
*[[Term:Positive/de|Positiv]]
*[[Term:Quarantine/de|Quarantäne]]
* [[Term:SARS-CoV-2/de|SARS-CoV-2]]
*[[Term:Variant of SARS-CoV-2/de|SARS-CoV-2-Virusvariante]]
*[[Term:Rapid diagnostic test/de|Schnelltest]]
* [[Term:Severe course of COVID-19/de|Schwerer Verlauf von Covid-19]]
*[[Term:Social distancing/de|Social Distancing]]
*[[Term:Symptomatic/de|Symptomatisch]]
*[[Term:Ventilator/de|Ventilator/de]]
* [[Term:Virus transmission/de|Virusübertragung]]</div>
==Italian entries==
<div class="wrapper" style="column-count:3">
*[[Term:Antibody/it|Anticorpo]]
*[[Term:Asymptomatic/it|Asintomatico]]
*[[Term:Vaccination programme/it|Campagna di vaccinazione]]
*[[Term:Big pharma/it|Casa farmaceutica]]
*[[Term:Proof of vaccination/it|Certificato di vaccinazione]]
*[[Term:Digital COVID certificate/it|Certificazione verde]]
*[[Term:Close contact/it|Contatti stretti]]
* [[Term:COVID-19/it|COVID-19]]
* [[Term:Social distancing/it|Distanziamento interpersonale]]
*[[Term:Dose/it|Dose]]
*[[Term:Booster shot/it|Dose di richiamo]]
*[[Term:Severe course of COVID-19/it|Forma grave di Covid-19]]
*[[Term:Lockdown/it|Lockdown]]
* [[Term:Face mask/it|Mascherina]]
* [[Term:Negative/it|Negativo]]
*[[Term:Anti-vaxxer/it|No-vax]]
*[[Term:Vaccine mandate/it|Obbligo vaccinale]]
* [[Term:Wave of pandemic/it|Ondata]]
*[[Term:Hospitalization/it|Ospedalizzazione]]
*[[Term:COVID-19 pandemic/it|Pandemia di COVID-19]]
*[[Term:Positive/it|Positivo]]
*[[Term:Restrictions/it|Restrizioni]]
*[[Term:Quarantine/it|Quarantena]]
*[[Term:SARS-CoV-2/it|SARS-CoV-2]]
*[[Term:Symptomatic/it|Sintomatico]]
*[[Term:Vaccination rate/it|Tasso di copertura vaccinale]]
*[[Term:Infection rate/it|Tasso di infezione]]
*[[Term:Intensive care/it|Terapia intensiva]]
*[[Term:Rapid diagnostic test/it|Test antigenico]]
*[[Term:RT-PCR test/it|Test molecolare]]
*[[Term:Virus transmission/it|Trasmissione]]
*[[Term:Vaccinate/it|Vaccinare]]
* [[Term:Coronavirus vaccine/it|Vaccino]]
* [[Term:MRNA vaccine/it|Vaccino a mRNA]]
*[[Term:Variant of SARS-CoV-2/it|Varianti di SARS-CoV-2]]
*[[Term:Ventilator/it|Ventilatore]]
* [[Term:Mechanical ventilation/it|Ventilazione meccanica]]</div>
==Hasła polskie==
<div class="wrapper" style="column-count:3">
*[[Term:Anti-vaxxer/pl|Antyszczepionkowiec]]
*[[Term:Asymptomatic/pl|Bezobjawowy]]
*[[Term:Big pharma/pl|Big pharma]]
*[[Term:Close contact/pl|Bliski kontakt]]
*[[Term:Severe course of COVID-19/pl|Ciężki przebieg COVID-19]]
*[[Term:COVID-19/pl|COVID-19]]
*[[Term:Digital COVID certificate/pl|Cyfrowe zaświadczenie COVID-19]]
*[[Term:Dose/pl|Dawka]]
*[[Term:Positive/pl|Dodatni]]
*[[Term:Social distancing/pl|Dystans społeczny]]
*[[Term:Wave of pandemic/pl|Fala pandemii]]
*[[Term:Hospitalization/pl|Hospitalizacja]]
* [[Term:Intensive care/pl|Intensywna opieka medyczna]]
*[[Term:Covidiot/pl|Kowidiota]]
*[[Term:Quarantine/pl|Kwarantanna]]
*[[Term:Lockdown/pl|Lockdown]]
* [[Term:Face mask/pl|Maseczka]]
*[[Term:Mechanical ventilation/pl|Mechaniczna wentylacja]]
*[[Term:Symptomatic/pl|Objawowy]]
*[[Term:Restrictions/pl|Obostrzenia]]
*[[Term:Vaccine mandate/pl|Obowiązek szczepień]]
*[[Term:COVID-19 pandemic/pl|Pandemia COVID-19]]
*[[Term:Vaccination rate/pl|Poziom wyszczepienia]]
*[[Term:Vaccination programme/pl|Program szczepień]]
*[[Term:Antibody/pl|Przeciwciało]]
*[[Term:Ventilator/pl|Respirator]]
*[[Term:SARS-CoV-2/pl|SARS-CoV-2]]
*[[Term:Vaccinate/pl|Szczepić]]
*[[Term:MRNA vaccine/pl|Szczepionka mRNA]]
*[[Term:Coronavirus vaccine/pl|Szczepionka przeciwko SARS-CoV-2]]
*[[Term:Rapid diagnostic test/pl|Szybki test]]
*[[Term:RT-PCR test/pl|Test RT-PCR]]
*[[Term:Virus transmission/pl|Transmisja wirusa]]
*[[Term:Booster shot/pl|Trzecia dawka]]
*[[Term:Negative/pl|Ujemny]]
*[[Term:Variant of SARS-CoV-2/pl|Wariant SARS-CoV-2]]
*[[Term:Infection rate/pl|Wskaźnik zakażeń]]
*[[Term:Proof of vaccination/pl|Zaświadczenie o szczepieniu]]</div>

Revision as of 17:34, 7 June 2022

Welcome to TermiKnowledge!

It is an educational project within the framework of the 4EU+ Alliance running in the academic year 2021–2022 for students of the University of Warsaw, Charles University in Prague, Heidelberg University and Milan University.

See more:

Entry lists

English entries

Czech entries

Deutsche Artikel

Italian entries

Hasła polskie