28 research outputs found

    The Reference Corpus of Contemporary Portuguese and related resources

    Get PDF
    The extraordinary growth of computer applications, particularly over the last two decades, has enabled the easy compilation and exploration of large corpora and lexica. These linguistic resources play a fundamental role in the areas of theoretical linguistics and natural language engineering. Combining these two areas of knowledge can, in fact, result in the development of a large number of applications, such as new and straightforward descriptions of languages based on real data; contrastive studies between varieties of a particular language aiming at finding factors of unity and diversity; cross-linguistic contrastive studies; grammars; lexica and dictionaries; terminologies; assisted translation materials; language teaching materials; computer tools and applications for processing natural language. Having this principle in mind and following the tradition at the Centre of Linguistics of the University of Lisbon (CLUL)i of collecting and studying real language data, a large electronic corpus – the Corpus de Referência do Português Contemporâneo (Reference Corpus of Contemporary Portuguese, CRPC) – is being compiled at CLUL since 1988. The CRPC currently contains approximately 310 million words, searchable through a user-friendly interface, and it is envisaged as a monitor corpus (from which one can extract balanced subcorpora) that can serve as a sample of the Portuguese language (both in its written and spoken varieties). In the next sections, we will describe the CRPC and how it forms the basis for important resources developed at CLUL.info:eu-repo/semantics/publishedVersio

    A Lexical Database of Portuguese Multiword Expressions

    Get PDF
    This presentation focuses on an ongoing project which aims at the creation of a large lexical database of Portuguese multiword (MW) units, automatically extracted through the analysis of a balanced 50 million word corpus, statistically interpreted with lexical association measures and validated by hand. This database covers different types of MW units, like named entities, and lexical associations ranging from sets of favoured co-occurring forms with high corpus frequency and low cohesion to strongly lexicalized expressions with no, or minimum, variation. This new resource has a two-fold objective: to be an important research tool which supports the development of collocation typologies and their integration in a larger theory of MW units; to be of major help in developing and evaluating language processing tools able of dealing with MW expressions.info:eu-repo/semantics/publishedVersio

    Corpus-based extraction and identification of Portuguese Multiword Expressions

    Get PDF
    This presentation reports the methodology followed and the results attained on an on-going project aiming at building a large lexical database of corpus-extracted multiword (MW) expressions for the Portuguese language. MW expressions were automatically extracted from a balanced 50 million word corpus compiled for this project, furthermore statistically interpreted using lexical association measures and are undergoing a manual validation process. The lexical database covers different types of MW expressions, from named entities to lexical associations with different degrees of cohesion, ranging from totally frozen idioms to favoured co-occurring forms, like collocations. We aim to achieve two main objectives with this resource: to build on the large set of data of different types of MW expressions to revise existing typologies of collocations and to integrate them in a larger theory of MW units; to use the extensive hand-checked data as training data to evaluate existing statistical lexical association measures.Cet article présente la méthodologie suivie et les résultats obtenus dans le cadre d’un projet qui a pour objectif la construction d’une large base de données d’expressions multi-mots de la langue portugaise. Ces expressions multi-mots ont été automatiquement extraites d’un corpus équilibré de 50 millions de mots, interprétées statistiquement à l’aide de mesures d’association lexicales et ont été ensuite manuellement vérifiées. La base de données lexicales recouvre différent types d’expressions multi-mots avec différents degrés de cohésion, qui vont de la quasi totale fixité jusqu’aux groupes de mots qui se réalisent préférentiellement ensemble, comme les collocations. Le large ensemble de données de cette ressource permettra une révision des typologies d’unités multi-mots en portugais et l’évaluation de différentes mesures d’associations lexicales.info:eu-repo/semantics/publishedVersio

    COMBINA-PT: a Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

    Get PDF
    This paper presents the COMBINA-PT project, a study of corpus-extracted Portuguese Multiword (MW) expressions. The objective of this on-going project is to compile a large lexical database of multiword (MW) units of the Portuguese language, automatically extracted from a balanced 50 million word corpus, interpreted with lexical association measures and manually validated. MW expressions considered in the database include named entities and lexical associations with different degrees of cohesion, ranging from frozen groups, which undergo little or no variation, to lexical collocations composed of words that tend to occur together and that constitute syntactic dependencies, although with a low degree of fixedness. This new resource has a two-fold objective: (i) to be an important research tool which supports the development of MW expressions typologies and their lexicographic treatment; (ii) to be of major help in developing and evaluating language processing tools able of dealing with MW expressionsinfo:eu-repo/semantics/publishedVersio

    A history of the Arabic language and the origin of non-dominant varieties of Arabic

    Get PDF
    To comprehend how Arabic became a pluricentric language, we need to navigate through its rich history. In this paper, I focus on three stages in the development of Arabic: Classical Arabic, Middle Arabic and Modern Arabic. I explain how the fate of Arabic was permanently sealed in the Classical period with the emergence of Islam and the subsequent Islamic conquests. At the peak of the Islamic empire, the codification of Arabic preserved it as a dominant written language. However, the indigenous languages which Arabic had displaced in new regions gave way to non-dominant regional varieties. These varieties continued to diverge from the codified variety during the Middle period, giving rise to diglossia in Arabic. I conclude with a review of the modern period and the Arabic revival efforts which marked the creation of Modern Standard Arabic while the colonially influenced non-dominant varieties drifted further still

    CQPWeb: Uma nova plataforma de pesquisa para o CRPC

    Get PDF
    We present a newly available online resource for Portuguese, a new version of the Reference Corpus of Contemporary Portuguese, now searchable via a user-friendly web interface. We report on work carried out on the corpus previous to its publication online, namely how the corpus was built, our choice of metadata and the processes and tools involved for the cleaning, preparation and annotation to make the corpus suitable for linguistic inquiries. We also describe the web platform and resume the extensive search options available for linguistic or NLP studies.info:eu-repo/semantics/publishedVersio

    Productivity and nutritional aspects in Moringa oleifera plants fertilized with NPK in south-eastern Brazil

    Get PDF
    Moringa oleifera is an Asian tree species cultivated for its nutritional and medicinal properties. Present study was carried out during 2021 and 2022 at Federal Institute of Minas Gerais, São João Evangelista-MG, Doce River Valley, South-Eastern Brazil to determine NPK doses for Moringa and critical NPK levels in the shoot. The experimental design was a fractional factorial of the type (1/2)43, with 32 treatments set up in four blocks. The treatments represented combinations of doses of N (0, 40, 80, 160 kg/ha); P2O5 (0, 45, 90, 180 kg/ha); and K2O (0, 20, 40, 80 kg/ha). Each experimental unit was composed of four seedlings arranged in a row. Plant shoot harvests were carried out every 60 days (January, March, May, July, September and November) for two consecutive years. The data were analyzed using regression analysis, and the adjusted response surface model was: Y = b0 + b1 N + b2 N2 + b3 P + b4 P2 + b5 K + b6 K2 + b7 NP + b8 NK + b9 PK. The productivity of fresh and dry matter of the shoot of M. oleifera plants was influenced only by N doses in the first year of cultivation, when the supply of 88.7 and 85.8 kg/ha of N led to the maximum estimated productivity for fresh (10,360.3 kg/ha) and dry matter (2,84.0 kg/ha). The critical contents in the shoot were 3.23 dag/kg (N), 0.41 dag/kg (P), and 2.18 dag/kg (K), while the annual extraction by plants was 78.44 kg/ ha (N), 8.99 kg/ha (P), and 48.81 kg/ha (K)

    Zika: abordagem clínica na atenção básica

    Get PDF
    Zika é uma doença que foi detectada no país no último ano, a partir deste evento a doença tem se disseminado no país, cursando de forma inédita segundo a literatura científica. Tendo encontrado ambiente favorável à sua disseminação, que é a presença do vetor Aedes em todo o país, em população sem imunidade à doença, vem causando enorme impacto à saúde de nossa população. É preciso que os profissionais de saúde se capacitem para conseguir minimizar o impacto desta enfermidade, utilizando todos os recursos possíveis para assistir, disseminar os conhecimentos para a população, além de construir parcerias com todos os equipamentos sociais para atuarem no sentido de proteger a saúde de todos. Para isso este material foi elaborado, tendo o caráter auto-instrucional, os profissionais de saúde podem realizá-lo dentro de suas possibilidades. O módulo tem 45h, sendo dividido em quatro unidades de ensino; ao final oferece uma avaliação objetiva e a certificação on-line. Na biblioteca estão disponibilizados livros e vídeos com conteúdos referentes ao tema, utilize-os se sentir necessidade de aprofundar seus conhecimentos.1.

    Taking the pulse of Earth's tropical forests using networks of highly distributed plots

    Get PDF
    Tropical forests are the most diverse and productive ecosystems on Earth. While better understanding of these forests is critical for our collective future, until quite recently efforts to measure and monitor them have been largely disconnected. Networking is essential to discover the answers to questions that transcend borders and the horizons of funding agencies. Here we show how a global community is responding to the challenges of tropical ecosystem research with diverse teams measuring forests tree-by-tree in thousands of long-term plots. We review the major scientific discoveries of this work and show how this process is changing tropical forest science. Our core approach involves linking long-term grassroots initiatives with standardized protocols and data management to generate robust scaled-up results. By connecting tropical researchers and elevating their status, our Social Research Network model recognises the key role of the data originator in scientific discovery. Conceived in 1999 with RAINFOR (South America), our permanent plot networks have been adapted to Africa (AfriTRON) and Southeast Asia (T-FORCES) and widely emulated worldwide. Now these multiple initiatives are integrated via ForestPlots.net cyber-infrastructure, linking colleagues from 54 countries across 24 plot networks. Collectively these are transforming understanding of tropical forests and their biospheric role. Together we have discovered how, where and why forest carbon and biodiversity are responding to climate change, and how they feedback on it. This long-term pan-tropical collaboration has revealed a large long-term carbon sink and its trends, as well as making clear which drivers are most important, which forest processes are affected, where they are changing, what the lags are, and the likely future responses of tropical forests as the climate continues to change. By leveraging a remarkably old technology, plot networks are sparking a very modern revolution in tropical forest science. In the future, humanity can benefit greatly by nurturing the grassroots communities now collectively capable of generating unique, long-term understanding of Earth's most precious forests.Additional co-authors: Susan Laurance, William Laurance, Francoise Yoko Ishida, Andrew Marshall, Catherine Waite, Hannsjoerg Woell, Jean-Francois Bastin, Marijn Bauters, Hans Beeckman, Pfascal Boeckx, Jan Bogaert, Charles De Canniere, Thales de Haulleville, Jean-Louis Doucet, Olivier Hardy, Wannes Hubau, Elizabeth Kearsley, Hans Verbeeck, Jason Vleminckx, Steven W. Brewer, Alfredo Alarcón, Alejandro Araujo-Murakami, Eric Arets, Luzmila Arroyo, Ezequiel Chavez, Todd Fredericksen, René Guillén Villaroel, Gloria Gutierrez Sibauty, Timothy Killeen, Juan Carlos Licona, John Lleigue, Casimiro Mendoza, Samaria Murakami, Alexander Parada Gutierrez, Guido Pardo, Marielos Peña-Claros, Lourens Poorter, Marisol Toledo, Jeanneth Villalobos Cayo, Laura Jessica Viscarra, Vincent Vos, Jorge Ahumada, Everton Almeida, Jarcilene Almeida, Edmar Almeida de Oliveira, Wesley Alves da Cruz, Atila Alves de Oliveira, Fabrício Alvim Carvalho, Flávio Amorim Obermuller, Ana Andrade, Fernanda Antunes Carvalho, Simone Aparecida Vieira, Ana Carla Aquino, Luiz Aragão, Ana Claudia Araújo, Marco Antonio Assis, Jose Ataliba Mantelli Aboin Gomes, Fabrício Baccaro, Plínio Barbosa de Camargo, Paulo Barni, Jorcely Barroso, Luis Carlos Bernacci, Kauane Bordin, Marcelo Brilhante de Medeiros, Igor Broggio, José Luís Camargo, Domingos Cardoso, Maria Antonia Carniello, Andre Luis Casarin Rochelle, Carolina Castilho, Antonio Alberto Jorge Farias Castro, Wendeson Castro, Sabina Cerruto Ribeiro, Flávia Costa, Rodrigo Costa de Oliveira, Italo Coutinho, John Cunha, Lola da Costa, Lucia da Costa Ferreira, Richarlly da Costa Silva, Marta da Graça Zacarias Simbine, Vitor de Andrade Kamimura, Haroldo Cavalcante de Lima, Lia de Oliveira Melo, Luciano de Queiroz, José Romualdo de Sousa Lima, Mário do Espírito Santo, Tomas Domingues, Nayane Cristina dos Santos Prestes, Steffan Eduardo Silva Carneiro, Fernando Elias, Gabriel Eliseu, Thaise Emilio, Camila Laís Farrapo, Letícia Fernandes, Gustavo Ferreira, Joice Ferreira, Leandro Ferreira, Socorro Ferreira, Marcelo Fragomeni Simon, Maria Aparecida Freitas, Queila S. García, Angelo Gilberto Manzatto, Paulo Graça, Frederico Guilherme, Eduardo Hase, Niro Higuchi, Mariana Iguatemy, Reinaldo Imbrozio Barbosa, Margarita Jaramillo, Carlos Joly, Joice Klipel, Iêda Leão do Amaral, Carolina Levis, Antonio S. Lima, Maurício Lima Dan, Aline Lopes, Herison Madeiros, William E. Magnusson, Rubens Manoel dos Santos, Beatriz Marimon, Ben Hur Marimon Junior, Roberta Marotti Martelletti Grillo, Luiz Martinelli, Simone Matias Reis, Salomão Medeiros, Milton Meira-Junior, Thiago Metzker, Paulo Morandi, Natanael Moreira do Nascimento, Magna Moura, Sandra Cristina Müller, Laszlo Nagy, Henrique Nascimento, Marcelo Nascimento, Adriano Nogueira Lima, Raimunda Oliveira de Araújo, Jhonathan Oliveira Silva, Marcelo Pansonato, Gabriel Pavan Sabino, Karla Maria Pedra de Abreu, Pablo José Francisco Pena Rodrigues, Maria Piedade, Domingos Rodrigues, José Roberto Rodrigues Pinto, Carlos Quesada, Eliana Ramos, Rafael Ramos, Priscyla Rodrigues, Thaiane Rodrigues de Sousa, Rafael Salomão, Flávia Santana, Marcos Scaranello, Rodrigo Scarton Bergamin, Juliana Schietti, Jochen Schöngart, Gustavo Schwartz, Natalino Silva, Marcos Silveira, Cristiana Simão Seixas, Marta Simbine, Ana Claudia Souza, Priscila Souza, Rodolfo Souza, Tereza Sposito, Edson Stefani Junior, Julio Daniel do Vale, Ima Célia Guimarães Vieira, Dora Villela, Marcos Vital, Haron Xaud, Katia Zanini, Charles Eugene Zartman, Nur Khalish Hafizhah Ideris, Faizah binti Hj Metali, Kamariah Abu Salim, Muhd Shahruney Saparudin, Rafizah Mat Serudin, Rahayu Sukmaria Sukri, Serge Begne, George Chuyong, Marie Noel Djuikouo, Christelle Gonmadje, Murielle Simo-Droissart, Bonaventure Sonké, Hermann Taedoumg, Lise Zemagho, Sean Thomas, Fidèle Baya, Gustavo Saiz, Javier Silva Espejo, Dexiang Chen, Alan Hamilton, Yide Li, Tushou Luo, Shukui Niu, Han Xu, Zhang Zhou, Esteban Álvarez-Dávila, Juan Carlos Andrés Escobar, Henry Arellano-Peña, Jaime Cabezas Duarte, Jhon Calderón, Lina Maria Corrales Bravo, Borish Cuadrado, Hermes Cuadros, Alvaro Duque, Luisa Fernanda Duque, Sandra Milena Espinosa, Rebeca Franke-Ante, Hernando García, Alejandro Gómez, Roy González-M., Álvaro Idárraga-Piedrahíta, Eliana Jimenez, Rubén Jurado, Wilmar López Oviedo, René López-Camacho, Omar Aurelio Melo Cruz, Irina Mendoza Polo, Edwin Paky, Karen Pérez, Angel Pijachi, Camila Pizano, Adriana Prieto, Laura Ramos, Zorayda Restrepo Correa, James Richardson, Elkin Rodríguez, Gina M. Rodriguez M., Agustín Rudas, Pablo Stevenson, Markéta Chudomelová, Martin Dancak, Radim Hédl, Stanislav Lhota, Martin Svatek, Jacques Mukinzi, Corneille Ewango, Terese Hart, Emmanuel Kasongo Yakusu, Janvier Lisingo, Jean-Remy Makana, Faustin Mbayu, Benjamin Toirambe, John Tshibamba Mukendi, Lars Kvist, Gustav Nebel, Selene Báez, Carlos Céron, Daniel M. Griffith, Juan Ernesto Guevara Andino, David Neill, Walter Palacios, Maria Cristina Peñuela-Mora, Gonzalo Rivas-Torres, Gorky Villa, Sheleme Demissie, Tadesse Gole, Techane Gonfa, Kalle Ruokolainen, Michel Baisie, Fabrice Bénédet, Wemo Betian, Vincent Bezard, Damien Bonal, Jerôme Chave, Vincent Droissart, Sylvie Gourlet-Fleury, Annette Hladik, Nicolas Labrière, Pétrus Naisso, Maxime Réjou-Méchain, Plinio Sist, Lilian Blanc, Benoit Burban, Géraldine Derroire, Aurélie Dourdain, Clement Stahl, Natacha Nssi Bengone, Eric Chezeaux, Fidèle Evouna Ondo, Vincent Medjibe, Vianet Mihindou, Lee White, Heike Culmsee, Cristabel Durán Rangel, Viviana Horna, Florian Wittmann, Stephen Adu-Bredu, Kofi Affum-Baffoe, Ernest Foli, Michael Balinga, Anand Roopsind, James Singh, Raquel Thomas, Roderick Zagt, Indu K. Murthy, Kuswata Kartawinata, Edi Mirmanto, Hari Priyadi, Ismayadi Samsoedin, Terry Sunderland, Ishak Yassir, Francesco Rovero, Barbara Vinceti, Bruno Hérault, Shin-Ichiro Aiba, Kanehiro Kitayama, Armandu Daniels, Darlington Tuagben, John T. Woods, Muhammad Fitriadi, Alexander Karolus, Kho Lip Khoon, Noreen Majalap, Colin Maycock, Reuben Nilus, Sylvester Tan, Almeida Sitoe, Indiana Coronado G., Lucas Ojo, Rafael de Assis, Axel Dalberg Poulsen, Douglas Sheil, Karen Arévalo Pezo, Hans Buttgenbach Verde, Victor Chama Moscoso, Jimmy Cesar Cordova Oroche, Fernando Cornejo Valverde, Massiel Corrales Medina, Nallaret Davila Cardozo, Jano de Rutte Corzo, Jhon del Aguila Pasquel, Gerardo Flores Llampazo, Luis Freitas, Darcy Galiano Cabrera, Roosevelt García Villacorta, Karina Garcia Cabrera, Diego García Soria, Leticia Gatica Saboya, Julio Miguel Grandez Rios, Gabriel Hidalgo Pizango, Eurídice Honorio Coronado, Isau Huamantupa-Chuquimaco, Walter Huaraca Huasco, Yuri Tomas Huillca Aedo, Jose Luis Marcelo Peña, Abel Monteagudo Mendoza, Vanesa Moreano Rodriguez, Percy Núñez Vargas, Sonia Cesarina Palacios Ramos, Nadir Pallqui Camacho, Antonio Peña Cruz, Freddy Ramirez Arevalo, José Reyna Huaymacari, Carlos Reynel Rodriguez, Marcos Antonio Ríos Paredes, Lily Rodriguez Bayona, Rocio del Pilar Rojas Gonzales, Maria Elena Rojas Peña, Norma Salinas Revilla, Yahn Carlos Soto Shareva, Raul Tupayachi Trujillo, Luis Valenzuela Gamarra, Rodolfo Vasquez Martinez, Jim Vega Arenas, Christian Amani, Suspense Averti Ifo, Yannick Bocko, Patrick Boundja, Romeo Ekoungoulou, Mireille Hockemba, Donatien Nzala, Alusine Fofanah, David Taylor, Guillermo Bañares-de Dios, Luis Cayuela, Íñigo Granzow-de la Cerda, Manuel Macía, Juliana Stropp, Maureen Playfair, Verginia Wortel, Toby Gardner, Robert Muscarella, Hari Priyadi, Ervan Rutishauser, Kuo-Jung Chao, Pantaleo Munishi, Olaf Bánki, Frans Bongers, Rene Boot, Gabriella Fredriksson, Jan Reitsma, Hans ter Steege, Tinde van Andel, Peter van de Meer, Peter van der Hout, Mark van Nieuwstadt, Bert van Ulft, Elmar Veenendaal, Ronald Vernimmen, Pieter Zuidema, Joeri Zwerts, Perpetra Akite, Robert Bitariho, Colin Chapman, Eilu Gerald, Miguel Leal, Patrick Mucunguzi, Miguel Alexiades, Timothy R. Baker, Karina Banda, Lindsay Banin, Jos Barlow, Amy Bennett, Erika Berenguer, Nicholas Berry, Neil M. Bird, George A. Blackburn, Francis Brearley, Roel Brienen, David Burslem, Lidiany Carvalho, Percival Cho, Fernanda Coelho, Murray Collins, David Coomes, Aida Cuni-Sanchez, Greta Dargie, Kyle Dexter, Mat Disney, Freddie Draper, Muying Duan, Adriane Esquivel-Muelbert, Robert Ewers, Belen Fadrique, Sophie Fauset, Ted R. Feldpausch, Filipe França, David Galbraith, Martin Gilpin, Emanuel Gloor, John Grace, Keith Hamer, David Harris, Tommaso Jucker, Michelle Kalamandeen, Bente Klitgaard, Aurora Levesley, Simon L. Lewis, Jeremy Lindsell, Gabriela Lopez-Gonzalez, Jon Lovett, Yadvinder Malhi, Toby Marthews, Emma McIntosh, Karina Melgaço, William Milliken, Edward Mitchard, Peter Moonlight, Sam Moore, Alexandra Morel, Julie Peacock, Kelvin Peh, Colin Pendry, R. Toby Pennington, Luciana de Oliveira Pereira, Carlos Peres, Oliver L. Phillips, Georgia Pickavance, Thomas Pugh, Lan Qie, Terhi Riutta, Katherine Roucoux, Casey Ryan, Tiina Sarkinen, Camila Silva Valeria, Dominick Spracklen, Suzanne Stas, Martin Sullivan, Michael Swaine, Joey Talbot, James Taplin, Geertje van der Heijden, Laura Vedovato, Simon Willcock, Mathew Williams, Luciana Alves, Patricia Alvarez Loayza, Gabriel Arellano, Cheryl Asa, Peter Ashton, Gregory Asner, Terry Brncic, Foster Brown, Robyn Burnham, Connie Clark, James Comiskey, Gabriel Damasco, Stuart Davies, Tony Di Fiore, Terry Erwin, William Farfan-Rios, Jefferson Hall, David Kenfack, Thomas Lovejoy, Roberta Martin, Olga Martha Montiel, John Pipoly, Nigel Pitman, John Poulsen, Richard Primack, Miles Silman, Marc Steininger, Varun Swamy, John Terborgh, Duncan Thomas, Peter Umunay, Maria Uriarte, Emilio Vilanova Torre, Ophelia Wang, Kenneth Young, Gerardo A. Aymard C., Lionel Hernández, Rafael Herrera Fernández, Hirma Ramírez-Angulo, Pedro Salcedo, Elio Sanoja, Julio Serrano, Armando Torres-Lezama, Tinh Cong Le, Trai Trong Le, Hieu Dang Tra
    corecore