I nessi semantici e sintattici e la loro rilevanza nella costruzione di 'thesauri'. Un esperimento

Giannoni, Roberto; Novaresio, Paolo

Informatica e diritto, IV Annata, Vol. IV, 1978, n. 1, pp. 20-44

Roberto Giannoni, Paolo Novaresio

I nessi semantici e sintattici e la loro rilevanza nella costruzione di "thesauri". Un esperimento

Semantic and Syntactic Relations and their Importance for the Thesauri Constructing

Nel campo dell'Information Storage and Retrieval (ISR) sono individuabili due distinte linee di ricerca: da un lato la tendenza a concentrarsi sugli aspetti statistici del linguaggio per cui la statistica viene considerata come un metodo operativamente semplice e adeguato a trattare tanto le ambiguità semantiche quanto i problemi sintattici presenti nel linguaggio comune e, dall'altro, il tentativo di costruire grammatiche artificiali sulla cui base creare una corrispondenza biunivoca, universalmente valida, tra segni e concetti designati. Secondo gli Autori, però, seguendo l'uno o l'altro orientamento si finisce sempre con lo scoprire ineliminabili elementi di reciproca interferenza o integrazione, per cui sembra giusto seguire una linea d'indagine che innesti la considerazione dei fenomeni probabilistici del linguaggio in uno studio di tipo semantico-sintattico. Sul presupposto che la ISR è agevole all'interno di linguaggi specifici, ma diventa ardua e pericolosa in campi come quello giuridico in cui i termini tecnici costituiscono una minoranza rispetto alle numerose voci attinte dai più diversi lessici della vita quotidiana (commercio, edilizia, medicina, ecc.), gli AA. Descrivono quindi le caratteristiche e le fasi di un esperimento da loro stessi condotto su due insiemi distinti di dati (i titoli e sottotitoli degli articoli apparsi in un'annata della rivista “Data Report” e un certo numero di regesti premessi all'edizione del Registrum Vetus del Comune di Sarzana).

Two different research trends can be noticed in the field of information storage and retrieval (ISR): on one side the statistical language aspects where statistics is considered a simple operation method able to salve semantic differences and syntactic problems of the usual language; on the other side the effort to build artificial grammars based on a univocal relation, widly valid far signs and concepts. The Authors affirm if the two orientations are followed some elements of reciprocal interference and integration can be noticed. It is important to follow a research line considering the probabilities of the language in a semantic and syntactic study. The ISR method is preferred for specific languages, but its use is difficult and dangerous in the legal field, where the technical words are not so important as the many words of the daily language (commerce, building, medicine etc.). The Authors describe the characteristics and the phases of an experiment carried out by them on two different groups of data (titles and subtitles of the articles published by the magazine «Data Report» in one year and a certain number of regests of the edition of "Registrum Vetus" of the town of Sarzana).

vai al testo integrale / see full text