4 research outputs found

    Automatic vs. Manual categorisation of documents in Spanish

    Get PDF
    Se analiza la clasificación automática de los documentos en España.Automatic categorisation can be understood as a learning process during which a programme recognises the characteristics that distinguish each category or class from others, i.e. those characteristics which the documents should have in order to belong tothat category. As yet few experiments have been carried out with documents in Spanish.Here we show the possibilities of elaborating pattern vectors that include thecharacteristics of different classes or categories of documents, using techniques based on those applied to the expansion of queries by relevance; likewise, the results of applyingthese techniques to a collection of documents in Spanish are given. The same collection of documents was classified manually and the results of both procedures were compared
    corecore