|
Assignment for Friday's class:
Assignment for Wednesday's class (next week):
|
Student questions
Bioinformatics Minor
Is it Bioinformatics or not?

| NCBI (National Center for Biotechnology Information) is a home for many public biological databases (see diagram below). All of the databases are interlinked, and they all have common search and retrieval system - Entrez. ![]() Another representation of the connections between the different databases in ENTRZ is here. For the interactive Pubmed tutorial click here. An Entrez tutorial (non interactive) is here (both go well beyond what you need to know for Friday). Use Boolean operators (AND, OR, NOT) to perform advanced searches. Here is an excellent explanation of the Boolean operators from the Library of Congress Help Page. Search Field Tags- Listed here. Explore features of Entrez interface: Limits, Index, History, Clipboard and MyNCBI. |
Other Useful Databases and Services:While Medline is incorporating more and more non-medical literature, there might still be gaps in the coverage. Alternatives are other databanks available though the National Library of Medicine (here) and the local services offered at the UConn libraries. Especially Current Contents and Agricola nicely complement PubMed. The best way to access them is the use of "SilverPlatter" database. Also, the "Web of Science" database gives access to the Science Citation Index: a database that tracks cited references in journals. Note that many resources are restricted to the UConn domain, thus you either need to access them from a campus computer or through the proxy account. In some instances you are prompted to connect to the UConn VPN network. Want to be informed about new sequences/articles in your research area? Check out these services:
|
In short, these services allow a user to define queries that are stored in the user's profile. Using these queries the searches are regularly performed against the updates to the databases, and then the user is i nformed (by email alert) if there is anything new that match the queries. A great way to save time! Other web pages: Nucleic Acid Research Database Issue http://www.ebi.ac.uk/ http://rdp.cme.msu.edu/ http://www.jgi.doe.gov/ http://www.genomesonline.org/ http://www.tigr.org http://genome-www.stanford.edu/ http://www.flybase.org/ http://www.arabidopsis.org/ http://www.ensembl.org/ |
| Sequence and structure databanks can be divided into many different categories. One of the most important is |
|
|
|
Old Powerpoint slides on data banks are here
In case you want more information on databanks, a computer sciences oriented intro to biological databanks, their design and management is at
https://gcrcweba.lacusc.co.la.ca.us/bifresources/BuildingUsingBiolDB.pdf
Discussion: What seperates living from dead?
|
What does Bioinformatics have to do with Molecular Evolution?
The following chain although (believed to be) mainly determined by the DNA sequence (plus other components of the cell which in turn are encoded by other parts of the genome) can at present not be simulated in a computer. DNA sequence -> ... . Most scientists believe that the principle of reductionism (plus new laws and relations emerging on each level) is true for this chain; however, this is clearly "in principle" only. At several steps along the way from DNA to function our understanding of the chemical and physical processes involved is so incomplete that prediction of protein function based on only a single DNA sequence is at present impossible (at least for a protein of reasonable size). Solution:
Present day proteins evolved through substitution and selection from ancestral proteins. Related proteins have similar sequence AND similar structure AND similar function. In the above mantra "similar function" can refer to:
The following is based on observation and not on an a priori truth:
THE REVERSE IS NOT TRUE:
In particular, PROTEINS WITH SHARED ANCESTRY DO NOT ALWAYS SHOW SIGNIFICANT SIMILARITY |
|||||||