Passage Retrieval in Log Files: An Approach Based on Query Enrichment

International audience The question answering systems are considered the next generation of search engines. This paper focuses on the first step of this process, which is to search for relevant passages containing answers. Passage Retrieval, can be difficult because of the complexity of data, log fi...

Full description

Bibliographic Details
Main Authors: Saneifar, Hassan, Bonniol, Stéphane, Laurent, Anne, Poncelet, Pascal, Roche, Mathieu
Other Authors: Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM), Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS), Satin IP Technologies, Université Montpellier 2 - Sciences et Techniques (UM2), Fouille de données environnementales (TATOO), Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS), Exploration et exploitation de données textuelles (TEXTE), Territoires, Environnement, Télédétection et Information Spatiale (UMR TETIS), Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-AgroParisTech-Centre national du machinisme agricole, du génie rural, des eaux et forêts (CEMAGREF)
Format: Conference Object
Language:English
Published: HAL CCSD 2010
Subjects:
Online Access:https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/document
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/file/IceTAL2010.pdf
https://doi.org/10.1007/978-3-642-14770-8_39
id ftlirmm:oai:HAL:lirmm-00816291v1
record_format openpolar
spelling ftlirmm:oai:HAL:lirmm-00816291v1 2023-11-05T03:42:54+01:00 Passage Retrieval in Log Files: An Approach Based on Query Enrichment Saneifar, Hassan Bonniol, Stéphane Laurent, Anne Poncelet, Pascal Roche, Mathieu Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM) Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS) Satin IP Technologies Université Montpellier 2 - Sciences et Techniques (UM2) Fouille de données environnementales (TATOO) Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS) Exploration et exploitation de données textuelles (TEXTE) Territoires, Environnement, Télédétection et Information Spatiale (UMR TETIS) Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-AgroParisTech-Centre national du machinisme agricole, du génie rural, des eaux et forêts (CEMAGREF) Reykjavík, Iceland 2010-08-16 https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291 https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/document https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/file/IceTAL2010.pdf https://doi.org/10.1007/978-3-642-14770-8_39 en eng HAL CCSD Springer-Verlag info:eu-repo/semantics/altIdentifier/doi/10.1007/978-3-642-14770-8_39 lirmm-00816291 https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291 https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/document https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/file/IceTAL2010.pdf doi:10.1007/978-3-642-14770-8_39 info:eu-repo/semantics/OpenAccess 7th International Conference on Natural Language Processing, IceTAL NLP: Natural Language Processing https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291 NLP: Natural Language Processing, Aug 2010, Reykjavík, Iceland. pp.357-368, ⟨10.1007/978-3-642-14770-8_39⟩ [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB] [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR] info:eu-repo/semantics/conferenceObject Conference papers 2010 ftlirmm https://doi.org/10.1007/978-3-642-14770-8_39 2023-10-10T22:38:12Z International audience The question answering systems are considered the next generation of search engines. This paper focuses on the first step of this process, which is to search for relevant passages containing answers. Passage Retrieval, can be difficult because of the complexity of data, log files in our case. Our contribution is based on the enrichment of queries by using a learning method and a novel term weighting function. This original term weighting function, used within the enrichment process, aims to assign a weight to terms according to their relatedness to the context of answers. Experiments conducted on real data show that our protocol of primitive query enrichment make it possible to retrieve relevant passages. Conference Object Iceland LIRMM: HAL (Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier) 357 368
institution Open Polar
collection LIRMM: HAL (Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier)
op_collection_id ftlirmm
language English
topic [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]
[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]
spellingShingle [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]
[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]
Saneifar, Hassan
Bonniol, Stéphane
Laurent, Anne
Poncelet, Pascal
Roche, Mathieu
Passage Retrieval in Log Files: An Approach Based on Query Enrichment
topic_facet [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]
[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]
description International audience The question answering systems are considered the next generation of search engines. This paper focuses on the first step of this process, which is to search for relevant passages containing answers. Passage Retrieval, can be difficult because of the complexity of data, log files in our case. Our contribution is based on the enrichment of queries by using a learning method and a novel term weighting function. This original term weighting function, used within the enrichment process, aims to assign a weight to terms according to their relatedness to the context of answers. Experiments conducted on real data show that our protocol of primitive query enrichment make it possible to retrieve relevant passages.
author2 Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM)
Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)
Satin IP Technologies
Université Montpellier 2 - Sciences et Techniques (UM2)
Fouille de données environnementales (TATOO)
Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)
Exploration et exploitation de données textuelles (TEXTE)
Territoires, Environnement, Télédétection et Information Spatiale (UMR TETIS)
Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-AgroParisTech-Centre national du machinisme agricole, du génie rural, des eaux et forêts (CEMAGREF)
format Conference Object
author Saneifar, Hassan
Bonniol, Stéphane
Laurent, Anne
Poncelet, Pascal
Roche, Mathieu
author_facet Saneifar, Hassan
Bonniol, Stéphane
Laurent, Anne
Poncelet, Pascal
Roche, Mathieu
author_sort Saneifar, Hassan
title Passage Retrieval in Log Files: An Approach Based on Query Enrichment
title_short Passage Retrieval in Log Files: An Approach Based on Query Enrichment
title_full Passage Retrieval in Log Files: An Approach Based on Query Enrichment
title_fullStr Passage Retrieval in Log Files: An Approach Based on Query Enrichment
title_full_unstemmed Passage Retrieval in Log Files: An Approach Based on Query Enrichment
title_sort passage retrieval in log files: an approach based on query enrichment
publisher HAL CCSD
publishDate 2010
url https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/document
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/file/IceTAL2010.pdf
https://doi.org/10.1007/978-3-642-14770-8_39
op_coverage Reykjavík, Iceland
genre Iceland
genre_facet Iceland
op_source 7th International Conference on Natural Language Processing, IceTAL
NLP: Natural Language Processing
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291
NLP: Natural Language Processing, Aug 2010, Reykjavík, Iceland. pp.357-368, ⟨10.1007/978-3-642-14770-8_39⟩
op_relation info:eu-repo/semantics/altIdentifier/doi/10.1007/978-3-642-14770-8_39
lirmm-00816291
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/document
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816291/file/IceTAL2010.pdf
doi:10.1007/978-3-642-14770-8_39
op_rights info:eu-repo/semantics/OpenAccess
op_doi https://doi.org/10.1007/978-3-642-14770-8_39
container_start_page 357
op_container_end_page 368
_version_ 1781700484892983296