Skip to Main content Skip to Navigation
Theses

Proposition d'un système de recherche d'information dans un environnement numérique distribué et hétérogène : application à l'industrie manufacturière

Abstract : The value of information in the manufacturing industry is an important issue. It enables informed decisions to be made and new value-added opportunities to be detected. When it is digitally transcribed, this information is composed of heterogeneous data and distributed in the different silos of the company, making it difficult to have a holistic view of the information. The thesis proposes to access the heterogeneous and distributed information of the company through an information retrieval system. The originality of the proposal consists in considering and modelling all the structured and unstructured data of the company in a single graph. On the other hand, the information retrieval is expressed by a query composed of two variables, the 'what' and the 'about what' and allows to provide as a result a list of documents or records, a list of property values or a list of sentences. The application of the approach to a case study has identified a list of key issues to be addressed in order to improve the usual performance criteria in information retrieval, namely its ability to provide all relevant results (recall) and only relevant results (precision). The four issues to be considered are: (i) the treatment of syntactic specificities of the data, (ii) the semantic extension of the terms used in the search, (iii) the filtering of irrelevant results and (iv) the detection of implicit links between the data. An enrichment of the proposal is then presented to address all these issues, including the transformation of tables in unstructured documents into a graph, a semantic extension of the search terms thanks to a knowledge graph, as well as additional filtering for the evaluation of the relevance of results. Finally, the enriched approach is confronted with a second case study in order to validate the proposal.
Complete list of metadata

https://pastel.archives-ouvertes.fr/tel-03675187
Contributor : ABES STAR :  Contact
Submitted on : Monday, May 23, 2022 - 8:10:35 AM
Last modification on : Friday, August 5, 2022 - 2:54:01 PM

File

kim.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-03675187, version 1

Citation

Lise Kim. Proposition d'un système de recherche d'information dans un environnement numérique distribué et hétérogène : application à l'industrie manufacturière. Génie des procédés. HESAM Université, 2021. Français. ⟨NNT : 2021HESAE051⟩. ⟨tel-03675187⟩

Share

Metrics

Record views

30

Files downloads

6