17 research outputs found
GERMAN LOANWORDS IN CROATIAN NEWSPAPER CORPORA
In this paper German loanwords that have been found automatically through detection in four widely circulated Croatian newspapers (Slobodna dalmacija, Novi list, VeÄernji list and Glas Slavonije) have been researched during a period of one year. The research is based on written, that is, digital sources ā the newspapers, which have been automatically searched for German loanwords. The list of German loanwords itself has been compiled by their listing from all the relevant written sources ā dictionaries and scientific papers. Even though the German language historically had quite a large influence on some parts of Croatia, the research has shown that German loanwords occur less than expected in contemporary newspaper texts. Their occurrence in the Croatian language is, however, systematic and it is possible to detect them by using automatic methods
German Loanwords in Digital Environment
Okosnicu istraživanja opisanom u Älanku Äini popis germanizama formiran iz rjeÄnika
hrvatskoga jezika, rjeÄnika stranih rijeÄi i doktorskih disertacija koje se bave prouÄavanjem
germanizama u hrvatskom jeziku. Pripremljeni popis germanizama omoguÄio je njihovu raÄunalnu
analizu u medijskom prostoru, odnosno hrvatskom mrežnom korpusu hrWaC-u, koji
obuhvaÄa tekstove objavljene na web-u u razdoblju od 4 godine. JeziÄne tehnologije potpomognute
naknadnom ruÄnom obradom podataka omoguÄile su utvrÄivanje uÄestalosti koriÅ”tenja
germanizama u pisanim tekstovima.
Metodama automatske detekcije germanizama pronaŔli smo germanizme u svim njihovim oblicima
koji su danas u uporabi u suvremenim tekstovima, Äime je ujedno stvoren i Äestotni
rjeÄnik germanizama.
Provedenim istraživanjem utvrÄena je zastupljenost germanizama u tekstovima suvremenog
hrvatskog jezika. Iako ukupni broj germanizama koji se danas pojavljuje u hrvatskim rjeÄnicima
i ostalim konzultiranim izvorima iz kojih je formiran osnovni popis iznosi 17 988 lema, u
uporabi je u suvremenim tekstovima dokazano 8 400 lema. PotvrÄenih 8 400 lema dokaz su
tomu da se leksiÄko blago hrvatskog jezika zabilježeno u rjeÄnicima nije izgubilo u suvremenom
hrvatskom jeziku, nego je sustavno uŔlo u korpus hrvatskog jezika te postoji u tekstovima
koji nisu nužno standardnojeziÄni.The framework of this research make German loan words compiled from the dictionaries of
the Croatian language, dictionaries of foreign words, and PhD thesis studying German loanwords
in the Croatian language. The list of German loanwords found in the contemporary
Croatian texts was analised using linguistic technologies supported by subsequent manual data
processing. The compiled list of gathered loan words allowed for their computational analysis
in the hrWaC web corpus comprising the texts from the whole .hr domain collected in a period
of four years.
This research has established the presence of the German loan words in contemporary Croatian
language texts. Although the total number of German loan words that appeared in contemporary
dictionaries of the Croatian language and other sources consulted during the research was
17 988 lemmas, we were able to confirm the usage for only 8 400 lemmas in contemporary
texts.
Applying the automatic detection method, we have found all German loan words used in
contemporary texts in all of their forms, simultaneously creating the German loan words frequency
dictionary. The confirmed 8 400 lemmas serve as proof that the lexical treasure of the
Croatian language recorded in dictionaries has not been lost in the contemporary Croatian
language. On the contrary, it has systematically entered into the web corpus of the Croatian
language and is found in the texts that do not necessarily belong to the standard language
German loanwords in the Croatian digital newspaper corpora
The object of the dissertation titled German loanwords in the Croatian digital newspaper corpora was to explore and analyze German loanwords in the contemporary Croatian texts using linguistic technologies supported by subsequent manual data processing.
Regarding German loanwords in Croatian language, German-Croatian contacts that left their traces in Croatian can be traced from the prehistoric time (Zepic, 2002).
As a consequence of political constellations in European history, Croatian language was under the influence of the German language until the 20th century. Due to the intense political and economic contacts, German loanwords entered the Croatian language on a large scale, with the largest acquisitions between 1527 and 1868, when Croatia was part of the Habsburg Monarchy.
On the other hand, from the early standardization phase of the Croatian language up to date, the process of language purification is being conducted, which reduced the number of German loanwords in the standard Croatian language.
But, for many years, German had an important role in Croatian language and its presence was especially noticeable in daily newspapers that were published in German in various Croatian cities. Publishing of newspapers in German during the history shows the importance and role of German language in Croatia, just as the contemporary daily press tends to be the reflection of the actual state of language.
Using language technologies, the research conducted as part of this PhD dissertation revealed to what extent German loanwords are used in Croatian language (i.e., the representation of German loanwords in contemporary Croatian language texts). The method analyzed the appearance of German loanwords in the Croatian corpora and regional daily newspapers.
Firstly, we have determined which German loanwords appear in the contemporary Croatian language, and as a source for compiling such a list we used dictionaries of standard Croatian language, dictionaries of foreign words, regional dictionaries and PhD thesis studying German loanwords in the Croatian language (nine sources in total). The analysis of these sources provided a list of 17.988 different German loanwords. The further comparative computer analysis of these sources showed that all German loanwords were not recorded in all sources. Many German loanwords were recorded in a single source (e.g. Petrovic's Eseker dictionary, which records German words from the Eseker dialect). The compiled list of German loanwords allowed for their computational analysis in the digitized daily press (namely four newspapers collected during the period of twelve months), as well as in the hrWaC web corpus comprising the texts from the whole .hr domain collected over a period of four years.
Searching hrWaC, we found 8.400 German loanwords out of 17.988 compiled from the nine sources. These 8.400 lemmas were found in 786.356 text files in their basic form (nominative case for nouns and adjectives, infinitive form for verbs) or in some other wordform.
The manual analysis of these 8.400 German loanwords showed that some wordforms of these German loanwords were identical to some Croatian words (i.e, they were homonyms). In some cases, these "fake" Germanisms had significantly higher frequency than the "real" German loanwords. Analysis of German loanwords allowed for the classification of German loanwords into two basic groups. The criterion for the classification was the overlap of German loanwords with Croatian language words, in one or all inflectional forms, or lack of such overlap.
The first group consists of German loanwords that have a meaning or meanings that bind to the original German word, which were (phonetically and/or morphologically) adapted to Croatian language, but still represent German loanwords.
The second group consists of those German loanwords that overlap with some Croatian words (i.e. they represent homonyms). This second group is divided into two sub-groups: the first sub-group consists of homonyms which belong to the same word class (e.g., "grad"- German loanword meaning "level" vs. "grad"- Croatian word meaning "city") while the second subgroup consists of homonyms that belong to different word classes (e.g., "lak"- German loanword meaning "varnish" vs. "lak"- Croatian word meaning "easy"), ie. the noun vs. the adjective).
Furthermore, the computer analysis of the frequency of German loanwords in daily newspapers showed that most of them appeared in Rijeka's Novi List. This newspaper is followed by Slobodna Dalmacija and Vecernji list, with a relatively equal number of German loanwords. The lowest frequency of German loanwords was determined in the daily newspaper Glas Slavonije.
In the period of twelve months we found a total of 191 German loanwords in four different daily newspapers: Novi List, Slobodna Dalmacija, Vecernji list and Glas Slavonije..
Given the cross-section of German loanwords found in daily newspapers (191 German loanwords) and hrWaC web corpus, all German loanwords found in newspapers have been confirmed in hrWaC web corpus, since hrWaC contains texts from the entire .hr domain, including digital editions of daily newspapers.
Furthemore, language technologies enabled us to determine how often German loanwords appear in contemporary texts - Croatian hrWaC web corpus and regional daily newspapers. Both sources are considered contemporary, since the newspaper and the corpus mirror the everyday use of the Croatian language.
This thesis gives the comparative analysis of German loanwords collected from various sources (dictionaries, scientific papers), revealing the overlap and the overlap frequency in all sources as well as the overlap frequency of German loanwords from written sources and contemporary digital texts (daily newspapers and corpus).
The study found that German loanwords that are most frequently found in web corpus originate from dictionaries of foreign words. The analysis of the frequency of German loanwords from the "New dictionary of foreign words" by Bratoljub KlaiÄ (2012) and "Dictionary of Foreign Words" by Anic, Klaic & Domovic (2001) showed that 965 German loanwords from these sources had frequency in hrWaC-in greater than 1,000.
This dissertation succeeded to built a mechanism for the analysis of German loanwords in Croatian language that can be applied to other languages. It is important to point out that regardless of the computer preparation and processing, much of the analysis had to be made manually.
The analysis of the frequency of German loanwords in the contemporary Croatian language resulted in a frequency dictionary of German loanwords in the Croatian language.
Out of four hypotheses, two were proven, one only partially, and one was not proven.
The hypothesis that German loanwords are systematically used in Croatian language is partially proven. The analysis revealed that out of 17.988 German loanwords collected from all written sources, 47% were found in hrWaC web corpus. However, out of this percentage, only 1.285 German loanwords occured in hrWaC with a frequency greater than 1000. Also, during the period of one year only 191 German loanwords were found in four daily newspapers (18.284 tokens). It was concluded that German loanwords in Croatian language appear systematically, but less than we would expect given the centuries of intensive influence of the German language in our country. The hypothesis that German loanwords without their equivalent in the Croatian language appear more often than German loanwords that have their Croatian equivalent was proven. Namely, 8.400 German loanwords that were found in the web corpus where additionally analyzed manually. It turned out that 52% of German loanwords that were analyzed manually has a Croatian equivalent.
The third hypothesis, stating that German loanwords collected from the standard dictionaries of the Croatian language appear uniformly in the regional editions of daily newspapers regardless of the region, was also proven.
The hypothesis is proven by analyzing digitized daily press over a period of twelve months. Statistics showed that German loanwords occur evenly in the newspapers from all regions. As for the frequency, German loanwords usually occur in the body of the text, followed by the main title, the heading and the sub-headings.
The hypothesis that German loanwords in some newspaper sections appear more frequently, indicating a specific use of German loanwords in certain fields of human activity, has not been proven. Digital processing of newspapers did not enable analysis of German loanwords by section, or the determination of specific applications of German loanwords in certain fields of human endeavor. The analysis of web corpus also did not allow searching by fields of human endeavor, since all texts were saved according to the URLs of the domain.
Finally, we can conclude that this research has established the presence of German loanwords in contemporary Croatian language texts. Although the total number of German loanwords that appeared in contemporary dictionaries of the Croatian language and other sources consulted during the research was 17.988 lemmas, we were able to confirm the usage for only 8.400 lemmas in contemporary texts.
Doktorska disertacija pod naslovom Germanizmi u digitalnim novinskim korpusima hrvatskoga jezika istražila je germanizme u suvremenim hrvatskim tekstovima koristeÄi jeziÄne tehnologije potpomognute naknadnom ruÄnom obradom podataka. Okosnicu istraživanja Äini popis germanizama formiran iz rjeÄnika hrvatskoga jezika, rjeÄnika stranih rijeÄi i doktorskih disertacija koje se bave prouÄavanjem germanizama u hrvatskom jeziku.
Pripremljeni popis germanizama omoguÄio je njihovu raÄunalnu analizu u digitaliziranom dnevnom tisku, Äetri novinska izdanja kroz period od godinu dana, te u web korpusu hrWaC-u koji obuhvaÄa tekstove objavljenje na webu u periodu od 4 godine.
Ovim istraživanjem utvrdili smo zastupljenost germanizama u tekstovima suvremenog hrvatskog jezika. Iako ukupni broj germanizama koji se danas pojavljuju u hrvatskim rjeÄnicima i ostalim konzultiranim izvorima iz kojih je formiran osnovni popis iznosi 17.988 lema, u uporabi je u suvremenim tekstovima dokazano 8.400 lema.
Metodama automatske detekcije germanizama pronaÅ”li smo germanizme u svim njihovim oblicima koji su danas u uporabi u suvremenim tekstovima, Äime je ujedno stvoren i Äestotni rjeÄnik germanizama.
Detaljnom analizom utvrÄeno je da se germanizmi u hrvatskom jeziku pojavljuju sustavno, ali rjeÄe no Å”to bismo oÄekivali s obzirom na stoljetni intenzivni utjecaj njemaÄkog jezika u naÅ”im krajevima. PotvrÄenih 8.400 lema dokaz su da se leksiÄko blago hrvatskog jezika zabilježeno u rjeÄnicima nije izgubilo u suvremenom hrvatskom jeziku, nego je sustavno uÅ”lo u korpus hrvatskog jezika te postoji u tekstovima koji nisu nužno standardnojeziÄni.
Dokazano je da se germanizmi bez svojeg ekvivalenta u hrvatskom jeziku pojavljuju ÄeÅ”Äe od germanizama koji imaju hrvatski ekvivalent.
KonaÄno, dokazano je da se germanizmi uvrÅ”teni u standardne rjeÄnike hrvatskoga jezika u regionalnim izdanjima dnevnih novina pojavljuju ravnomjerno bez obzira na regiju, Å”to znaÄi da nisu nužno regionalno obojeni.The object of the study of the dissertation titled German loan words in the Croatian digital newspaper corpora are the German loan words found in the contemporary Croatian texts using linguistic technologies supported by subsequent manual data processing. A list of German loan words compiled from the dictionaries of the Croatian language, dictionaries of foreign words, and PhD thesis studying German loan words in the Croatian language constitutes the framework of this research.
The compiled list of German loan words allowed for their computational analysis in the digitized daily press (namely four newspapers collected during the period of one year), as well as in the hrWaC web corpus comprising the texts from the whole .hr domain collected in a period of four years.
This research has established the presence of the German loan words in contemporary Croatian language texts. Although the total number of German loan words that appeared in contemporary dictionaries of the Croatian language and other sources consulted during the research was 17988 lemmas, we were able to confirm the usage for only 8400 lemmas in contemporary texts.
Applying the automatic detection method, we have found all German loan words used in contemporary texts in all of their forms, simultaneously creating the German loan words frequency dictionary. A detailed analysis has revealed that German loan words occur systematically in the Croatian language, but less frequently than expected, considering substantial, centuries-old influence of the German language in the Croatian regions. The confirmed 8400 lemmas serve as proof that the lexical treasure of the Croatian language recorded in dictionaries has not been lost in the contemporary Croatian language. On the contrary, it has systematically entered into the web corpus of the Croatian language and is found in the texts that do not necessarily belong to the standard language.
It has been proven that the German loan words without an equivalent in the Croatian language occur more frequently in the language than the German loan words with their Croatian equivalent. Finally, it has been proven that German loan words that are part of the standard dictionaries of the Croatian language, are equally distributed in regional issues of daily newspapers, regardless of the region, which suggests that their usage is not necessarily region specific
MJERENJE BRZINE RADA PHP MODULA
This paper presents how C programming language could be used for this type of tasks, created as PHP module and then get imported in the PHP language. The purpose of this paper is to show how and when is better to build PHP modules in C, instead of normal PHP functions, and to show negative sides of this type of programming. Profiling, as an important aspect of finding application bottlenecks, is also discussed. Profiling systems like Xdebug, Qcachegrind and Webgrind are also elucidated. This paper contains source code which calculates Fibonacci sequence and multiplies 800x800 matrices written in C and PHP programming languages. Results show that whenever there is a need for a mathematical computation, C will be many times faster and that it is much more cost-effective to write such code in C and create a PHP module.Ovaj rad predstavlja kako se za ovakve probleme može koristiti programski jezik C, kao PHP modul, te pozvati iz PHP jezika. Cilj ovog ujedno je pokazati kako i kada je bolje, umjesto obiÄnih PHP funkcija, raditi PHP module u programskom jeziku C te prikazati i negativne strane takvoga programiranja. Rad objaÅ”njava i pronalazak āuskog-grlaā u aplikacijama pomoÄu sustava kao Å”to su Xdebug, qcachegrind i webgrind. U radu je prikazan izvorni kod koji raÄuna Fibonaccijev niz te množi matrice veliÄine 800x800 u programskim jezicima C i PHP. Dobiveni rezultati pokazuju da je u svim sluÄajevima gdje je bio potreban matematiÄki izraÄun, programski jezik C bio viÅ”estruko brži te da je u takvim sluÄajevima isplativije napisati izvorni kod u programskom jeziku C i izraditi PHP modul
MJERENJE BRZINE RADA PHP MODULA
This paper presents how C programming language could be used for this type of tasks, created as PHP module and then get imported in the PHP language. The purpose of this paper is to show how and when is better to build PHP modules in C, instead of normal PHP functions, and to show negative sides of this type of programming. Profiling, as an important aspect of finding application bottlenecks, is also discussed. Profiling systems like Xdebug, Qcachegrind and Webgrind are also elucidated. This paper contains source code which calculates Fibonacci sequence and multiplies 800x800 matrices written in C and PHP programming languages. Results show that whenever there is a need for a mathematical computation, C will be many times faster and that it is much more cost-effective to write such code in C and create a PHP module.Ovaj rad predstavlja kako se za ovakve probleme može koristiti programski jezik C, kao PHP modul, te pozvati iz PHP jezika. Cilj ovog ujedno je pokazati kako i kada je bolje, umjesto obiÄnih PHP funkcija, raditi PHP module u programskom jeziku C te prikazati i negativne strane takvoga programiranja. Rad objaÅ”njava i pronalazak āuskog-grlaā u aplikacijama pomoÄu sustava kao Å”to su Xdebug, qcachegrind i webgrind. U radu je prikazan izvorni kod koji raÄuna Fibonaccijev niz te množi matrice veliÄine 800x800 u programskim jezicima C i PHP. Dobiveni rezultati pokazuju da je u svim sluÄajevima gdje je bio potreban matematiÄki izraÄun, programski jezik C bio viÅ”estruko brži te da je u takvim sluÄajevima isplativije napisati izvorni kod u programskom jeziku C i izraditi PHP modul
Digital transformation of academic writing
Scientific and professional research and writing academic papers have always been a challenge for
people with experience and for those who are just starting with these activities. Nowadays, when
the digital transformation in this field has reached its full bloom, we have various digital tools at our
disposal to help us in research and writing academic papers in an organized way. Digital technologies
have created a new ecosystem for research, both for searching and accessing information as well as
for writing with the possibility of mutual collaboration among authors. The paper highlights the
major categories of digital tools and describes the most important ones in more detail. Digital tools
help authors throughout the research process and in writing academic papers. These include thinking
about an idea, taking notes from the literature, citing, adding graphical elements, and collaborating
with colleagues in real time. A new paradigm in writing academic papers with the help of artificial
intelligence was briefly discussed, which includes the use of language models of natural language, and
text processing with a critical review of the ethics of using these tools was briefly discussed.
The research was conducted on the use of digital tools in two populations: teachers and students. The
obtained research results were presented and discussed, after which appropriate conclusions were
drawn
Digital transformation of academic writing
Scientific and professional research and writing academic papers have always been a challenge for
people with experience and for those who are just starting with these activities. Nowadays, when
the digital transformation in this field has reached its full bloom, we have various digital tools at our
disposal to help us in research and writing academic papers in an organized way. Digital technologies
have created a new ecosystem for research, both for searching and accessing information as well as
for writing with the possibility of mutual collaboration among authors. The paper highlights the
major categories of digital tools and describes the most important ones in more detail. Digital tools
help authors throughout the research process and in writing academic papers. These include thinking
about an idea, taking notes from the literature, citing, adding graphical elements, and collaborating
with colleagues in real time. A new paradigm in writing academic papers with the help of artificial
intelligence was briefly discussed, which includes the use of language models of natural language, and
text processing with a critical review of the ethics of using these tools was briefly discussed.
The research was conducted on the use of digital tools in two populations: teachers and students. The
obtained research results were presented and discussed, after which appropriate conclusions were
drawn
Costume Design In Three Spectral Areas
This paper presents the innovative possibilities of graphic technology for use in visual scenic activities. The individual values of costumes and scenography in the ultraviolet, visible and infrared spectrum are reproduced with new graphic technologies. Each element of the costumes becomes visible when illuminated with different values of wavelengths of light observed through modified cameras. With the addition of black light in the lighting design, the viewer sees only the ultraviolet element. In the absence of black light the costumes reveal only the design created in the area visible to the bare eye. The set-design elements created in infrared values, invisible to the bare eye are shown with a projection, simultaneously. The presented technique represents a new means of expression in all forms of visual art
Development of a custom template for Wordpress system
U ovom radu objasnili smo ideju i razvoj WordPress sustava za upravljanje sadržajem. Ukazali smo na prednost i moguÄnosti koriÅ”tenja prilagoÄenog predloÅ”ka za specifiÄno web rjeÅ”enje. Na osnovu dva pristupa objasnili smo kako razviti prilagoÄeni predložak. Prikazali smo osnovne faze razvoja prilagoÄenog predloÅ”ka klasiÄnim pristupom te modernim pristupom gdje se za razvoj predloÅ”ka koristi WordPress REST API i Vue.js framework. Rubno smo u radu dotakli i na koji naÄin dodaci mogu olakÅ”ati i poboljÅ”ati web stranicu.The authors have explained the idea and development of a WordPress content management system. The benefits and capabilities of using a custom template for a specific web solution have been highlighted. We have explained how to develop a custom template based on two approaches. The basic stages of developing a custom template with the classic approach and a modern approach have been displayed where the WordPress REST API and Vue.js framework are being used to develop the template. We have also mentioned how plugins can facilitate and enhance a website
APPLICATION OF DRUPAL CMS IN BUILDING WEB SYSTEMS
U radu Äe se izvrÅ”iti analiza i usporedba Drupal-a
sa sliÄnim sustavima i tehnologijama. Usporedba
Äe se napraviti sa trenutno najpopularnijim
sustavima iz podruÄja u kojima se Drupal koristi,
a to su WordPress, Joomla, Magento i custom
razvoj aplikacija. Kroz usporedbu Äe se prikazati
prednosti i mane pojedinog sustava koje uzrokuju
veÄu ili manju primjenu istoga. Na kraju analize
izvuÄi Äe se dobiveni zakljuÄak te postaviti uvid u
smjer kojim Äe se odvijati daljnji razvoj pojedinog
sustava.The paper will present comparison and analysis of
Drupal with similar systems and technologies. A
comparison will be made with the currently most
popular systems in the field in which Drupal is
used, such as WordPress, Joomla, Magento, and
custom application development. The comparison
will show the advantages and disadvantages of
each system that cause the system to be used more
or less. At the end of the analysis a conclusion
will be drawn out and an insight set up into the
direction of further development of individual
systems