Difference between revisions of "Main Page"

From TermiKnowledge
Jump to navigation Jump to search
 
(15 intermediate revisions by the same user not shown)
Line 1: Line 1:
== Welcome to TermiKnowledge! ==
'''<big>Welcome to TermiKnowledge!</big>'''
It is an educational project within the framework of the 4EU+ Alliance running in the academic year 2021–2022 for students of the University of Warsaw, Charles University in Prague, Heidelberg University and Milan University.


<div class="wrap" style="column-count:2">=== About the Project ===
This is a multilingual terminological knowledge base about COVID-19 developed as an educational project within the framework of the 4EU+ Alliance, co-funded by the Erasmus+ Programme of the European Union.
Our team consisted of 7 teachers [LINK TO TEACHER BIO PAGE] and more than 30 student participants (7–8 students per university + 1 Student Assistant per university). Their names, degree programmes and roles in the project can be found here [LINK TO STUDENT BIO PAGE]


Our goal in the 1<sup>st</sup> semester was to compile a multilingual knowledge base on COVID-19. Following a series of six weekly lectures delivered by the teachers and spanning such topics as terminology science, corpus linguistics and lexicography, we agreed on a list of 40 key concepts and worked to provide terms in the four languages of the countries where our universities are (Czech, German, Italian and Polish) and in English. Knowledge base entries were based exclusively on text excerpted from corpora in the five languages, nearly all of which were compiled by project participants.  
The knowledge base covers almost 40 parallel terminological entries in five languages: English, Czech, German, Italian, and Polish. They were compiled on the basis of parallel corpora of various text genres. You can choose an entry from one of the lists below or search for it using the search box.


Four corpora were created for each language, comprising four different types of texts related to COVID-19. One type was normative texts (legal regulations such as EU directives, national regulations, and medical guidelines), another was research texts (published research papers), then press texts (general press, not specialized medical journals) and finally texts found online in comments sections under press articles.  The purpose here was to take a multifaceted “snapshot” of the linguistic landscape of COVID-19. Thus, there were a total of 20 corpora altogether. Not all concepts were represented by terms in all types of corpora.
'''See more:'''


Of the 20 corpora, 19 were compiled by our participants. However, instead of compiling our own corpus of research texts in English, we used a huge corpus available on the Sketch Engine website, consisting of texts that were released as part of the COVID-19 Open Research Dataset (CORD-19). <small>''Reference: COVID-19 Open Research Dataset (CORD-19). 2020. Version 2020-05-02. Retrieved from https://pages.semanticscholar.org/coronavirus-research.&#x20;doi:10.5281/zenodo.3715505<nowiki/>.''</small>
* [[About TermiKnowledge]]
* [[Information about the corpora|Technical parameters of the corpora]]
* [[Students|Student participants]]
* [[Instructors]]
*[[Publications]]
</br>


Some technical parameters regarding our corpora can be found here [TECHNICAL PAGE LINK].
=== Entry lists ===


=== Entry Structure ===
Each entry in a given language combines information from up to four corpora. Therefore, entries are generally subdivided into four sections (Normative, Press, Research & Comments) corresponding to the different corpora. If a corpus did not contain any single- or multiword terms or expressions (headword, variants and synonyms) reflecting a given concept, a note so stating can be seen, and the usual field headings are not present.


While most headwords are terms, i.e. units of language naming concepts that are part of the conceptual system of a given field of specialized knowledge or activity, some (such as ''anti-vaxxer'') are obviously non-terminological expressions used in general language<ins>.</ins>
==English entries==
<div class="wrapper" style="column-count:3">
*[[Term:Anti-vaxxer|Anti-vaxxer]]
*[[Term:Antibody|Antibody]]
*[[Term:Asymptomatic|Asymptomatic]]
*[[Term:Big pharma|Big pharma]]
*[[Term:Booster shot|Booster shot]]
*[[Term:COVID-19|COVID-19]]
*[[Term:COVID-19 pandemic|COVID-19 pandemic]]
*


Below the headword there is a list of related terms with a specification of the relation in each case. The specifications are written in natural language. We did not use a semiotic code.
*[[Term:Close contact|Close contact]]
 
*[[Term:Coronavirus vaccine|Coronavirus vaccine]]
The fields within each section relate to conceptual and linguistic aspects of the headword.
*[[Term:Covidiot|Covidiot]]
 
*[[Term:Digital COVID certificate|Digital COVID certificate]]
The conceptual domain is reflected in definitions (which may include encyclopaedic information) and descriptions. Definitions, with encyclopaedic information added where possible, are given in the Normative section. It was assumed that any description of the entry term following the rules prescribed for intensional (hypernym + distinctive features) or extensional (list of types of objects covered) definitions would represent good material for inclusion in the Definition field while all other information describing a term would be considered encyclopaedic information. The Research section includes a Definition/Description field, because we expected that it would be difficult to find formal definitions in research papers, but that scientific descriptions of concepts could be excerpted from those corpora. The Press and Comments sections include just a Description field, where entry authors would place any definition-like quotations. You will notice that the conceptual fields are sometimes skipped, which is because entry authors could not find suitable quotations to use as a definition or description. This may be partly due to the rather limited size of a number of our corpora, but also partly stem from the nature of a particular entry headword.
*[[Term:Dose|Dose]]
 
*[[Term:Face mask|Face mask]]
All definitions, descriptions and pieces of encyclopaedic information are direct quotations from texts in the respective corpora. The quotations were not altered in any way. We also did not prepare our own definitions that summarized what we had read. Thus, our approach to definition mining was exclusively corpus-driven.
*[[Term:Hospitalization|Hospitalization]]
 
*[[Term:Infection rate|Infection rate]]
Some of our headwords were obviously not terms, for example emotion-laden units like ''anti-vaxxer'' or ''covidiot'', but this was not an obstacle to finding quotations that described their meaning. By way of experiment, we also included ''negative'' and ''positive'' as headwords, even though these adjectives are classified as term elements rather than standalone terms. The compilers noted that these two behaved differently in texts and not all uses were related to COVID-19, but this proved no obstacle to the successful generation of entries.
*[[Term:Intensive care|Intensive care]]
 
*[[Term:Lockdown|Lockdown]]
The linguistic fields begin with Variants and Synonyms, which are found above the Definition/Description field. We decided not to distinguish between (spelling, morphological, etc.) variants and true synonyms.
*[[Term:MRNA vaccine|mRNA vaccine]]
 
*[[Term:Mechanical ventilation|Mechanical ventilation]]
All synonyms and variants can be your starting point for accessing entries through a redirection mechanism.
*[[Term:Negative|Negative]]
 
*[[Term:Positive|Positive]]
The remaining fields providing linguistic information comprise Examples, followed by Collocations and an indication of the Keyness (see below) of the headword and any variants or synonyms attested in a given corpus.
*[[Term:Proof of vaccination|Proof of vaccination]]
 
*[[Term:Quarantine|Quarantine]]
Sentences that served as Examples were selected from among those that did not quite qualify as definitions/descriptions, for example, because they did not contain general information. The presence of frequent collocations also played a role.
*[[Term:RT-PCR test|RT-PCR test]]
 
*[[Term:Rapid diagnostic test|Rapid diagnostic test]]
Collocations were included on the basis of their frequency. Although steps were taken to ensure a unified layout, particularly of collocations with verbs to distinguish subject and object collocations, some liberty was allowed in that respect, too.
*[[Term:Restrictions|Restrictions]]
 
*[[Term:SARS-CoV-2|SARS-CoV-2]]
==== The Keyness Field ====
*[[Term:Severe course of COVID-19|Severe course of COVID-19]]
The values in the Keyness field refer to the position of a particular term or expression on the keyness list for a particular corpus. Note that Sketch Engine generates two keyword lists, separately for single words and potential multiword expressions, and thus there were two keyness lists for each corpus. That is why the keyness figure given for a single-word keyword may be higher (which generally means that its frequency was lower) than the figure for a multiword expression including this headword, since the keyness data were derived from a different list.
*[[Term:Social distancing|Social distancing]]
 
*[[Term:Symptomatic|Symptomatic]]
The reference corpora were the default national web-crawl corpora suggested by Sketch Engine.
*[[Term:Vaccinate|Vaccinate]]
 
*[[Term:Vaccination programme|Vaccination programme]]
We encountered a number of problems with keyness lists. In the case of highly inflected languages (Czech and Polish), some lexemes were not lemmatized by Sketch Engine, and so the occurrences of the different paradigm forms (cases, tense-person-number forms, etc.) were counted separately, the result being, obviously, that the associated keyness figures were rather low (high positions on keyness lists). Keyness positions are nevertheless provided in view of their potential utility for making comparisons between synonyms, terms or languages.</div>
*[[Term:Vaccination rate|Vaccination rate]]
* [[Term:Vaccine mandate|Vaccine mandate]]
*[[Term:Variant of SARS-CoV-2|Variant of SARS-CoV-2]]
*[[Term:Ventilator|Ventilator]]
* [[Term:Virus transmission|Virus transmission]]
*[[Term:Wave of pandemic|Wave of pandemic]]</div>
==Czech entries==
<div class="wrapper" style="column-count:3">
*[[Term:Rapid diagnostic test/cz|Antigenní test]]
*[[Term:Anti-vaxxer/cz|Anti-vaxxer/cz]]
*[[Term:Asymptomatic/cz|Asymptomatický]]
*[[Term:Close contact/cz|Blízký kontakt]]
*[[Term:COVID-19/cz|COVID-19]]
*[[Term:Covidiot/cz|Covidiot]]
*[[Term:Dose/cz|Dávka]]
*[[Term:Digital COVID certificate/cz|Digitální certifikát EU COVID]]
*[[Term:Hospitalization/cz|Hospitalizace]]
*[[Term:Intensive care/cz|Jednotka intenzivní péče]]
*[[Term:Quarantine/cz|Karanténa]]
*[[Term:Lockdown/cz|Lockdown]]
*[[Term:Mechanical ventilation/cz|Mechanická ventilace]]
*[[Term:MRNA vaccine/cz|mRNA vakcína]]
*[[Term:Proof of vaccination/cz|Národní certifikát o provedeném očkování]]
*[[Term:Negative/cz|Negativní]]
*[[Term:Vaccination programme/cz|Očkovací schéma]]
*[[Term:Vaccinate/cz|Očkovat]]
*[[Term:Social distancing/cz|Odstup]]
*[[Term:Restrictions/cz|Opatření]]
*[[Term:Positive/cz|Pozitivní]]
*[[Term:Antibody/cz|Protilátka]]
*[[Term:COVID-19 pandemic/cz|Pandemie COVID-19]]
*[[Term:Booster shot/cz|Posilovací dávka]]
*[[Term:Vaccine mandate/cz|Povinné očkování]]
*[[Term:Vaccination rate/cz|Proočkovanost]]
*[[Term:Face mask/cz|Respirátor]]
*[[Term:RT-PCR test/cz|RT-PCR test]]
*[[Term:SARS-CoV-2/cz|SARS-CoV-2]]
*[[Term:Infection rate/cz|Šíření nákazy]]
*[[Term:Virus transmission/cz|Šíření viru]]
*[[Term:Infection rate/cz|Symptomatický]]
*[[Term:Severe course of COVID-19/cz|Těžký průběh COVID-19]]
*[[Term:Coronavirus vaccine/cz|Vakcína proti COVID-19]]
*[[Term:Variant of SARS-CoV-2/cz|Varianta SARS-CoV-2]]
*[[Term:Ventilator/cz|Ventilátor]]
*[[Term:Wave of pandemic/cz|Vlna]]</div>
==Deutsche Artikel==
<div class="wrapper" style="column-count:3">
*[[Term:Antibody/de|Antikörper]]
*[[Term:Asymptomatic/de|Asymptomatisch]]
*[[Term:Booster shot/de|Auffrischimpfung]]
*[[Term:Virus transmission/de|Beatmungsgerät]]
*[[Term:Restrictions/de|Beschränkungen]]
*[[Term:Big pharma/de|Big Pharma]]
*[[Term:COVID-19/de|Covid-19]]
*[[Term:Coronavirus vaccine/de|Covid-19-Impfung]]
*[[Term:COVID-19 pandemic/de|Covid-19-Pandemie]]
*[[Term:Wave of pandemic/de|Covid-19-Welle]]
*[[Term:Digital COVID certificate/de|Digitales COVID-Zertifikat]]
*[[Term:Dose/de|Dosis]]
*[[Term:Close contact/de|Enger Kontakt]]
*[[Term:Hospitalization/de|Hospitalisierung]]
*[[Term:Vaccinate/de|Impfen]]
*[[Term:Anti-vaxxer/de|Impfgegner]]
*[[Term:Vaccination programme/de|Impfkampagne]]
*[[Term:Proof of vaccination/de|Impfnachweis]]
*[[Term:Vaccine mandate/de|Impfpflicht]]
*[[Term:Vaccination rate/de|Impfquote]]
*[[Term:Infection rate/de|Infektionsrate]]
*[[Term:Intensive care/de|Intensivmedizinische Behandlung]]
*[[Term:Lockdown/de|Lockdown]]
*[[Term:Mechanical ventilation/de|Mechanische Beatmung]]
*[[Term:Face mask/de|Medizinische Gesichtsmaske]]
*[[Term:MRNA vaccine/de|mRNA-Impfstoff]]
*[[Term:Negative/de|Negativ]]
*[[Term:RT-PCR test/de|PCR-Test]]
*[[Term:Positive/de|Positiv]]
*[[Term:Quarantine/de|Quarantäne]]
*[[Term:SARS-CoV-2/de|SARS-CoV-2]]
*[[Term:Variant of SARS-CoV-2/de|SARS-CoV-2-Virusvariante]]
*[[Term:Rapid diagnostic test/de|Schnelltest]]
*[[Term:Severe course of COVID-19/de|Schwerer Verlauf von Covid-19]]
*[[Term:Social distancing/de|Social Distancing]]
*[[Term:Symptomatic/de|Symptomatisch]]
*[[Term:Ventilator/de|Ventilator/de]]
*[[Term:Virus transmission/de|Virusübertragung]] </div>
==Italian entries==
<div class="wrapper" style="column-count:3">
*[[Term:Antibody/it|Anticorpo]]
*[[Term:Asymptomatic/it|Asintomatico]]
*[[Term:Vaccination programme/it|Campagna di vaccinazione]]
*[[Term:Big pharma/it|Casa farmaceutica]]
*[[Term:Proof of vaccination/it|Certificato di vaccinazione]]
*[[Term:Digital COVID certificate/it|Certificazione verde]]
*[[Term:Close contact/it|Contatti stretti]]
*[[Term:COVID-19/it|COVID-19]]
*[[Term:Social distancing/it|Distanziamento interpersonale]]
*[[Term:Dose/it|Dose]]
*[[Term:Booster shot/it|Dose di richiamo]]
*[[Term:Severe course of COVID-19/it|Forma grave di Covid-19]]
*[[Term:Lockdown/it|Lockdown]]
*[[Term:Face mask/it|Mascherina]]
*[[Term:Negative/it|Negativo]]
*[[Term:Anti-vaxxer/it|No-vax]]
*[[Term:Vaccine mandate/it|Obbligo vaccinale]]
*[[Term:Wave of pandemic/it|Ondata]]
*[[Term:Hospitalization/it|Ospedalizzazione]]
*[[Term:COVID-19 pandemic/it|Pandemia di COVID-19]]
*[[Term:Positive/it|Positivo]]
*[[Term:Restrictions/it|Restrizioni]]
*[[Term:Quarantine/it|Quarantena]]
*[[Term:SARS-CoV-2/it|SARS-CoV-2]]
*[[Term:Symptomatic/it|Sintomatico]]
*[[Term:Vaccination rate/it|Tasso di copertura vaccinale]]
*[[Term:Infection rate/it|Tasso di infezione]]
*[[Term:Intensive care/it|Terapia intensiva]]
*[[Term:Rapid diagnostic test/it|Test antigenico]]
*[[Term:RT-PCR test/it|Test molecolare]]
*[[Term:Virus transmission/it|Trasmissione]]
*[[Term:Vaccinate/it|Vaccinare]]
*[[Term:Coronavirus vaccine/it|Vaccino]]
*[[Term:MRNA vaccine/it|Vaccino a mRNA]]
*[[Term:Variant of SARS-CoV-2/it|Varianti di SARS-CoV-2]]
*[[Term:Ventilator/it|Ventilatore]]
*[[Term:Mechanical ventilation/it|Ventilazione meccanica]]</div>
==Hasła polskie==
<div class="wrapper" style="column-count:3">
*[[Term:Anti-vaxxer/pl|Antyszczepionkowiec]]
*[[Term:Asymptomatic/pl|Bezobjawowy]]
*[[Term:Big pharma/pl|Big pharma]]
*[[Term:Close contact/pl|Bliski kontakt]]
*[[Term:Severe course of COVID-19/pl|Ciężki przebieg COVID-19]]
*[[Term:COVID-19/pl|COVID-19]]
*[[Term:Digital COVID certificate/pl|Cyfrowe zaświadczenie COVID-19]]
*[[Term:Dose/pl|Dawka]]
*[[Term:Positive/pl|Dodatni]]
*[[Term:Social distancing/pl|Dystans społeczny]]
*[[Term:Wave of pandemic/pl|Fala pandemii]]
*[[Term:Hospitalization/pl|Hospitalizacja]]
*[[Term:Intensive care/pl|Intensywna opieka medyczna]]
*[[Term:Covidiot/pl|Kowidiota]]
*[[Term:Quarantine/pl|Kwarantanna]]
*[[Term:Lockdown/pl|Lockdown]]
*[[Term:Face mask/pl|Maseczka]]
*[[Term:Mechanical ventilation/pl|Mechaniczna wentylacja]]
*[[Term:Symptomatic/pl|Objawowy]]
*[[Term:Restrictions/pl|Obostrzenia]]
*[[Term:Vaccine mandate/pl|Obowiązek szczepień]]
*[[Term:COVID-19 pandemic/pl|Pandemia COVID-19]]
*[[Term:Vaccination rate/pl|Poziom wyszczepienia]]
*[[Term:Vaccination programme/pl|Program szczepień]]
*[[Term:Antibody/pl|Przeciwciało]]
*[[Term:Ventilator/pl|Respirator]]
*[[Term:SARS-CoV-2/pl|SARS-CoV-2]]
*[[Term:Vaccinate/pl|Szczepić]]
*[[Term:MRNA vaccine/pl|Szczepionka mRNA]]
*[[Term:Coronavirus vaccine/pl|Szczepionka przeciwko SARS-CoV-2]]
* [[Term:Rapid diagnostic test/pl|Szybki test]]
*[[Term:RT-PCR test/pl|Test RT-PCR]]
*[[Term:Virus transmission/pl|Transmisja wirusa]]
*[[Term:Booster shot/pl|Trzecia dawka]]
*[[Term:Negative/pl|Ujemny]]
*[[Term:Variant of SARS-CoV-2/pl|Wariant SARS-CoV-2]]
*[[Term:Infection rate/pl|Wskaźnik zakażeń]]
*[[Term:Proof of vaccination/pl|Zaświadczenie o szczepieniu]]</div>

Latest revision as of 06:21, 30 July 2022

Welcome to TermiKnowledge!

This is a multilingual terminological knowledge base about COVID-19 developed as an educational project within the framework of the 4EU+ Alliance, co-funded by the Erasmus+ Programme of the European Union.

The knowledge base covers almost 40 parallel terminological entries in five languages: English, Czech, German, Italian, and Polish. They were compiled on the basis of parallel corpora of various text genres. You can choose an entry from one of the lists below or search for it using the search box.

See more:


Entry lists

English entries

Czech entries

Deutsche Artikel

Italian entries

Hasła polskie