Supporting Remote User Defined Functions in Heterogeneous Biological Databases

Similar to most scientic studies, biological analyses demand a great deal of computations and simulations involving sophisticated tools that are often found geographically distributed over the Internet. A worldwide eort in genomics research has resulted in a powerful collection of publicly available...

Full description

Bibliographic Details
Main Authors: Liangyou Chen, Hasan M. Jamil
Other Authors: The Pennsylvania State University CiteSeerX Archives
Format: Text
Language:English
Published: 2001
Subjects:
DML
Online Access:http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.21.1142
http://www.cs.msstate.edu/~jamil/my-pub-papers/chen-bibe-2001.ps.gz
id ftciteseerx:oai:CiteSeerX.psu:10.1.1.21.1142
record_format openpolar
spelling ftciteseerx:oai:CiteSeerX.psu:10.1.1.21.1142 2023-05-15T16:02:02+02:00 Supporting Remote User Defined Functions in Heterogeneous Biological Databases Liangyou Chen Hasan M. Jamil The Pennsylvania State University CiteSeerX Archives 2001 application/postscript http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.21.1142 http://www.cs.msstate.edu/~jamil/my-pub-papers/chen-bibe-2001.ps.gz en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.21.1142 http://www.cs.msstate.edu/~jamil/my-pub-papers/chen-bibe-2001.ps.gz Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://www.cs.msstate.edu/~jamil/my-pub-papers/chen-bibe-2001.ps.gz text 2001 ftciteseerx 2016-01-07T17:46:05Z Similar to most scientic studies, biological analyses demand a great deal of computations and simulations involving sophisticated tools that are often found geographically distributed over the Internet. A worldwide eort in genomics research has resulted in a powerful collection of publicly available sequence analysis tools. These tools often require specialized local services and domain knowledge to function correctly, rendering them unlikely candidates for integration into remote database applications. Thus, integration of heterogeneous \functions" still remains an open problem. Providing a reasonable framework for seamless integration of these tools with database query engines will enable application developers to exploit and harness the power of these eective analysis tools. In this paper, we present an integration framework for such tools by enabling access to them in a user transparent way as part of database queries. In our system, such online tools are abstracted as remote user dened functions (RUDF). An extended SQL DDL language, called the Internet Function Denition Language (IFDL), is presented for the specication and denition of RUDFs. The interface between database system and the Internet is implemented using a layer based on a language called the Hyper Text Query Language (HTQL). The separation of IFDL, DDL, HTQL and SQL DML oers several optimization opportunities and makes it possible to develop an architecture for interoperability of heterogeneous databases with RUDFs in more simple and ecient ways. 1 Text DML Unknown
institution Open Polar
collection Unknown
op_collection_id ftciteseerx
language English
description Similar to most scientic studies, biological analyses demand a great deal of computations and simulations involving sophisticated tools that are often found geographically distributed over the Internet. A worldwide eort in genomics research has resulted in a powerful collection of publicly available sequence analysis tools. These tools often require specialized local services and domain knowledge to function correctly, rendering them unlikely candidates for integration into remote database applications. Thus, integration of heterogeneous \functions" still remains an open problem. Providing a reasonable framework for seamless integration of these tools with database query engines will enable application developers to exploit and harness the power of these eective analysis tools. In this paper, we present an integration framework for such tools by enabling access to them in a user transparent way as part of database queries. In our system, such online tools are abstracted as remote user dened functions (RUDF). An extended SQL DDL language, called the Internet Function Denition Language (IFDL), is presented for the specication and denition of RUDFs. The interface between database system and the Internet is implemented using a layer based on a language called the Hyper Text Query Language (HTQL). The separation of IFDL, DDL, HTQL and SQL DML oers several optimization opportunities and makes it possible to develop an architecture for interoperability of heterogeneous databases with RUDFs in more simple and ecient ways. 1
author2 The Pennsylvania State University CiteSeerX Archives
format Text
author Liangyou Chen
Hasan M. Jamil
spellingShingle Liangyou Chen
Hasan M. Jamil
Supporting Remote User Defined Functions in Heterogeneous Biological Databases
author_facet Liangyou Chen
Hasan M. Jamil
author_sort Liangyou Chen
title Supporting Remote User Defined Functions in Heterogeneous Biological Databases
title_short Supporting Remote User Defined Functions in Heterogeneous Biological Databases
title_full Supporting Remote User Defined Functions in Heterogeneous Biological Databases
title_fullStr Supporting Remote User Defined Functions in Heterogeneous Biological Databases
title_full_unstemmed Supporting Remote User Defined Functions in Heterogeneous Biological Databases
title_sort supporting remote user defined functions in heterogeneous biological databases
publishDate 2001
url http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.21.1142
http://www.cs.msstate.edu/~jamil/my-pub-papers/chen-bibe-2001.ps.gz
genre DML
genre_facet DML
op_source http://www.cs.msstate.edu/~jamil/my-pub-papers/chen-bibe-2001.ps.gz
op_relation http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.21.1142
http://www.cs.msstate.edu/~jamil/my-pub-papers/chen-bibe-2001.ps.gz
op_rights Metadata may be used without restrictions as long as the oai identifier remains attached to it.
_version_ 1766397667064152064