Relational Databases for Querying Natural Language Text

dc.contributor.authorRafiei, Davood
dc.contributor.authorChubak, Pirooz
dc.date.accessioned2025-05-01T21:13:46Z
dc.date.available2025-05-01T21:13:46Z
dc.date.issued2007
dc.descriptionTechnical report TR07-08. With the vast amount of information stored in natural language text, sophisticated query engines are needed to pull data and effectively relate the pieces. While there has been a great deal of activity around semistructured data and in particular XML, there has not been much work on querying natural language text, despite the regularities that exist in natural language text. This paper explores a more conservative approach where natural language text is stored in a relational database. We present a framework for querying and integrating natural language text with relational data and investigate different strategies for optimizing queries. Our results show that the size of the plan space depends on the number of query terms and the overlap between query rewritings. Moreover, we show that the complexity of finding an optimal plan in the presence of rewritings is NP-hard. We develop a cost model and pruning techniques to reduce the size of the search space, and a polynomial-time greedy algorithm that finds a sub-optimal plan over a set of rewritings. Our experimental results indicate great savings in the evaluation costs of the optimized queries and that our greedy algorithm finds either an optimal plan or a plan that is very close to optimal in terms of cost. | TRID-ID TR07-08
dc.identifier.doihttps://doi.org/10.7939/R34F1MH80
dc.language.isoen
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/
dc.subjectNatural language queries
dc.subjectRelational databases
dc.titleRelational Databases for Querying Natural Language Text
dc.typehttp://purl.org/coar/resource_type/c_93fc
ual.jupiterAccesshttp://terms.library.ualberta.ca/public

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR07-08.pdf
Size:
2.57 MB
Format:
Adobe Portable Document Format