![]() |
![]() |
|||||||||
|
||||||||||
IndexDownload CTGA-DISF
|
CTGA Database SearchHelp TopicsAs the retrieval of information from the CTGA database is as important as filling data in, we paid considerable attention to provide the users with the option of performing complicated queries to obtain focused results without sacrificing the simplicity. At present, the CTGA database can be queried using any of three available modes of search:
In any of the search modes, the user issues a command that is interpreted in the CTGA server. Keyword processing occurs using the system's specialized mark-up language, Claris Dynamic Markup Language (CDML). Accordingly, results are sent to the user's browser as a standard HTML document with no requirement for any additional software. Query results include the names and corresponding OMIM numbers of genes and genetic disorders described in Arab people. By selecting a record among the results, the user accesses extensive details relating to a specific gene or genetic disorder. Simple Search StrategiesIn this section, the user would evaluate the behavior of simple search queries in the three search modes currently available at the CTGA Database. Simple queries may include single or multiple keywords. Example 1: Searching for information on Leukemia In Basic Search, enter the syntax leukemia in the search box then click on the SEARCH button. The search engine will search for the term leukemia throughout all fields in all records in the database and returns back with a minimum of 16 records sorted by default alphabetically according to disease/gene names (September 2006). 1. Amegacaryocytic Thrombocytopenia, Congenital In Advanced Search, enter the syntax leukemia in the top search box then click on the SEARCH button. The search engine will search for the term leukemia throughout all fields in all records in the database and returns back with a minimum of 16 records (September 2006); these exactly represent the results obtained in basic search mode. Note that in this mode the user may control the way results appear by sorting them according to disease/gene names or by corresponding OMIM Numbers. 1. B-Cell CLL/Lymphoma 2 Example 2: Searching for information on the syndactyly (webbed or conjoined fingers) symptom In Basic Search, enter the syntax syndactyly in the search box and click on the SEARCH button. The search engine will search for the term syndactyly throughout all fields in all records in the database and returns back with a minimum of 17 records (September 2006). 1. Absence Defect of Limbs, Scalp, and Skull In Advanced Search, enter the syntax syndactyly in the search box and click on the SEARCH button. The search engine will search for the term syndactyly throughout all fields in all records in the database and returns back with a minimum of 17 records (September 2006); these exactly represent the results obtained in basic search mode. 1. Absence Defect of Limbs, Scalp, and Skull Example 3: Searching for information on alpha-thalassemia using its OMIM number (141800) In Basic Search, enter the syntax 141800 in the search box then click on the SEARCH button. The search engine will search for the number 141800 throughout all fields in all records in the database and returns back with the record related to alpha-thalassemia. Example 4: Searching for genetic disease records in CTGA by the family name (Al-Gazali) of a corresponding investigator In Basic Search, enter the syntax Al-Gazali in the upper search box then click on the SEARCH button. The search engine will search for the family name Al-Gazali throughout all fields in all records and returns back with a minimum of 69 records (September 2006). Phrase SearchingPhrase searching is used to consider keywords as phrases. Hence, the purpose of this type of search is to look for keywords that occur side by side in a record and in the order given, whereas in simple searching, multiple keywords do not necessarily have to be next to each other. In CTGA searching, using parentheses quotes " " around search words is the way to do phrase searching. Example 5: Searching for CTGA records by the phrase "renal agenesis" In Basic Search, enter the syntax "Renal Agenesis" in the search box (including the "" marks) then click on the SEARCH button. The search engine will search for the phrase throughout all fields in all records in the database and returns back with a minimum of 6 records (September 2006). 1. Abdominal Muscles, Absence of, with Urinary Tract Abnormality and Cryptorchidism In Advanced Search, enter the syntax "Renal Agenesis" in the top search box (including the "" marks) then click on the SEARCH button. The search engine will search for the phrase throughout all fields in all records in the database and returns back with the same set of records obtained by basic search. 1. Abdominal Muscles, Absence of, with Urinary Tract Abnormality and Cryptorchidism Please note that removing the " " marks from the syntax in this example will result in larger numbers of records since the CTGA search engine will not recognize the phrase Renal Agenesis as a single search term, but will search for renal and agenesis as two separate keywords. Example 6: Searching for genetic disease records in CTGA by the full name (Andre Megarbane) of a corresponding investigator In Basic Search, enter the syntax "Megarbane A", in the format of last name plus initials with no punctuation, in the search box (including the " " marks) then click on the SEARCH button. The search engine will search for the name Andre Megarbane throughout all fields in all records in the database and returns back with a minimum of 34 records (September 2006). Example 7: Searching for records citing references from Nature Genetics journal. In Basic Search, enter the name of the journal according to the MEDLINE abbreviation "Nat Genet" in the search box (including the " " marks) then click on the SEARCH button. The search engine will search for the term "Nat Genet" throughout all fields in all records in the database and finds 13 records (September 2006). Wildcard SearchingBy default, the CTGA search engine permits the use of the * wildcard symbol to perform truncation queries. A wildcard symbol is a special character that will match any other(s), including absence of a character. Example 8: Searching for keywords starting with the prefix Hemo- In Basic Search, enter the syntax Hemo (or Hemo*) in the search box (including the * marks) then click on the SEARCH button. The search engine will search for all possible keywords with the root Hemo throughout all fields in all records in the database and finds a minimum of 45 records (September 2006). Terms that are searched in this example include: Hemoglobin, Hemostasis, Hemolysis, Hemochromatosis, Hemophagocytic, Hemorrhagic, Methemoglobin, and many others. Example 9: Searching for keywords ending with the suffix -dactyly In Basic Search, enter the syntax dactyly (or *dactyly) in the search box (including the * marks) then click on the SEARCH button. The search engine will search for all words ending with the suffix dactyly throughout all fields in all records in the CTGA database and returns back with a minimum of 55 records (September 2006). Terms that are searched in this example include: Syndactyly, Brachydactyly, Camptodactyly, Ectrodactyly, and many others. Example 10: Letter substitution to search for the keywords micro- and macrocephaly. In Basic Search, enter the syntax M*crocephaly in the search box then click on the SEARCH button. The search engine will search for all possible combinations of the keyword (microcephaly or macrocephaly) throughout all fields in all records in the CTGA database and returns back with a minimum of 49 records (September 2006). Test this search again by entering either microcephaly or macrocephaly; results will include a minimum of 38 or 13 records, respectively. No difference occurred between results obtained using basic or advanced search in all examples described thus far. The main reason for this outcome is that in both modes the CTGA search engines address a query to all fields of all records in the CTGA database. However, the importance of advanced search querying in CTGA is expressed in its capacity to conduct Boolean searching, as we shall see in the following section. Boolean searchingSearching the CTGA Database can also be formulated according to the principles of the Boolean logic, named for the British-born Irish mathematician George Boole. Boolean searching refers to the logical relationship among search terms. Carrying out a Boolean search is not as frightening as it may sound. You simply apply the common Boolean operators to your search string to construct logical relationships among search terms: search term BOOLEAN OPERATOR search term By default, querying using a string of keywords in basic, advanced, or power search modes results in the implementation of the AND operator, where all keywords should occur in common records irrespective of their sequence. However, the advanced CTGA search engine surpasses the other two search modes by its capacity to process two or more keywords with 'OR' and 'NOT' Boolean connectors. Example 11: Searching for an association between leukemia and neurofibromatosis. In Basic Search, enter the syntax leukemia neurofibromatosis in the search box and then click the SEARCH button. By default, the AND Boolean operator is utilized and the CTGA search engine seeks records where BOTH terms occur together throughout all fields and returns back with a minimum of 1 record (September 2006). 1. Neurofibromatosis, Type I Remember that in example 1, a simple search for the term leukemia resulted in 16 records. Hence, one of these records also included the term neurofibromatosis. Example 12: Searching for leukemia without any association with neurofibromatosis. Since basic and power search do not have the option of applying the NOT operator, in Advanced Search, enter the syntax leukemia in the upper search box, neurofibromatosis in the lower search box, chose the NOT Boolean connector, and then click on the SEARCH button. The search engine will search for all records with the term leukemia excluding all records containing the term neurofibromatosis occurring throughout all fields in all records in the database and return back with a minimum of 15 records (September 2006). 1. Amegacaryocytic Thrombocytopenia, Congenital From the interpretation made in the previous example, the results obtained herein can be easily verified since 16 records had the term leukemia including one in which the term neurofibromatosis also occurred. It is also important to note that the CTGA search engine processes Boolean search in a left-to-right sequence. Hence, the syntax leukemia NOT neurofibromatosis will search for records containing the term leukemia, but excluding all those containing the term neurofibromatosis whereas the syntax neurofibromatosis NOT leukemia will search for records containing the term neurofibromatosis but not the term leukemia. Example 13: Searching for records with either the term leukemia or the term neurofibromatosis. Since basic and power search do not have the option of applying the OR operator, in Advanced Search, enter the syntax leukemia in the upper search box, neurofibromatosis in the lower search box, chose the OR Boolean connector, and then click on the SEARCH button. The search engine will search for all records with either terms leukemia or neurofibromatosis occurring throughout all fields in all records in the database and returns back with a minimum of 18 records (September 2006). 1. Amegacaryocytic Thrombocytopenia, Congenital The interpretation of this result is for the reason that three records in the CTGA database contain the term neurofibromatosis (September 2006) and 16 records contain the term leukemia (including one common record with neurofibromatosis). ![]() Figure 1. Illustration of how the AND, OR, and NOT Boolean search logic works. Boolean-based search is not only limited to simple queries as indicated in examples 11 through 13. Additionally, all examples indicated in simple, phrase, and wildcard search strategies may be clustered in a multitude of compound queries and subject them to Boolean logic. So far, all examples described in this section demonstrated the application of Boolean logic in queries containing a maximum of two terms. Queries with three or more terms are interpreted differently in the Advanced Search mode in CTGA. The examples described in Table 5 best express the behavior of the top and bottom Advanced Search boxes. By default, a string of keywords in either the top or bottom boxes are treated with the AND Boolean operator. The same applies when the NOT operator is utilized. However, the use of the OR operator replaces the default action of the bottom box where multiple keywords are treated with the OR operator among themselves too. ![]() Figure 2. Examples of multiple keyword queries and their results using Boolean logic in the CTGA Advanced Search mode (results as of September 2006). Composite Search StrategiesAll search strategies described thus far included examples in which the user is searching for a keyword, a string of keywords, or phrases occurring in a defined field (power search) or anywhere in all CTGA records (basic and advanced modes). However, in certain cases querying multiple keywords, each occurring in a specific field, is the key to achieve more focused results. Actually, basic and advanced search engines in CTGA map all queried keywords similarly to all fields in all records in the database and, thus, are not very efficient tools in this situation. In the following examples, we shall see the advantage of utilizing the power search mode in CTGA database to query for multiple keywords each mapped to a specific field in CTGA records. Example 14: Searching for all Nervous System Disorders mapped to Chromosome 1 in the Arab Populations. In Power Search, click the button Disease Phenotype, chose “Diseases of the Nervous System” from the drop-down menu, click the Chromosome 1 button, chose the results sorting preference (optional), then click on the SEARCH button. This query results in 7 records (September 2006). 1. Charcot-Marie-Tooth Disease, Axonal, Type 2B1 Example 15: Searching for Autosomal Recessive disorders mapped to Chromosome 4 in the population of the United Arab Emirates. In Power Search, click the button Disease Phenotype, leave the drop-down menu unchecked to search in all disease classifications, click the Chromosome 4, Autosomal, Recessive, and United Arab Emirates checkboxes, chose the results sorting preference (optional), then click on the SEARCH button. This search results in 3 records (September 2006): 1. Autonomic Control, Congenital Failure of Example 16: Searching for Gene Loci mapped to the X Chromosome in the Lebanese population. In Power Search, click the buttons Gene Locus, Chromosome X, and Lebanon. This search will result in 3 records (September 2006): 1. OCRL Gene Please note that in the categories Chromosome, Mode of Inheritance, and Country, it is possible to select multiple checkboxes. However, in the Record Category, search is restricted to one option at a time, either genetic disease (Disease Phenotype) or disease-causing gene (Gene Locus) since CTGA records are classified in either option at a time. Example 17: Searching for all disorders leading to either micro- or macrocephaly and that Prof. Al-Gazali described in the United Arab Emirates. In Power Search, enter the keywords M*crocephaly in the Symptom, Molecular Pathology, Epidemiology search box and Al-Gazali in the Literature Search box. Then click the United Arab Emirates checkbox, chose the results sorting preference (optional), and then click on the SEARCH button. This search results in 11 records (September 2006): 1. Achondroplasia By combining the ease of buttons, checkboxes, and drop-down menu selections with proper term searching in power search boxes mining the CTGA database gains a further dimension. Undoubtedly, the correct use of the CTGA Power Search Engine will help any average end-user to retrieve well-focused answers. |