Literature
Resources for Bioinformatics
compiled by Andrea
Dinkelman, Assistant Professor and Science & Technology Librarian
Iowa State University
adinkelm@iastate.edu
Presentation for the
Computational and Systems Biology Summer Institute (CSBSI)
Friday, June 20, 2008
Last updated 6/19/2008
Goals of the session
1. To
provide a brief introduction to the ISU Library.
2. To provide a brief
orientation to the NCBI web site.
3.
How to use PubMed effectively to search the scientific literature.
4. Briefly mention other
major research literature databases useful in bioinformatics research.
ISU Library Website
- http://www.lib.iastate.edu/
Useful links:
1. Collections ---->
Indexes
& Abstracts - listing of databases available from ISU Library
2. Collections ---->
e-Journals & e-Books
- listing electronic journals available from ISU Library
3. Library
catalog - search for books, journals, CDs, & DVDs owned by ISU
4. Electronic
books - electronic books about bioinformatics, genetics, genomics
Research Questions
1. What is the status
of the genome sequencing project for the tomato?
2. What is the scientific name for
the honeybee?
3. Where can you find information
about the human HTT gene?
4. Your
professor has asked you to retrieve the following articles, and all you have
is a list of numbers: 17384733; 16886994; and 16216716. Can you find these?
5. You've been asked to locate some
review articles about the swine genetics published within the last 3 years.
6. You want to be alerted via email
when new articles are published about the evolution of turtles.
Answers will be revealed using the
NCBI website and PubMed!
I. National Center for Biotechnology
Information (NCBI) - http://www.ncbi.nlm.nih.gov/
- Created in 1988 as a part of the
National Library of Medicine at NIH
What does the NCBI do?
NCBI accepts submissions
of primary data (e.g. GenBank). Original research data submitted by researchers.
NCBI develops tools to analyze these data. (e.g. BLAST)
NCBI uses these tools to create derivative databases based on the primary
data (e.g Refseq).
NCBI provides free searching, linking, and retrieval of these data,
primarily through the Entrez system.
Data Domains http://www.ncbi.nlm.nih.gov/Class/MLACourse/Modules/Databases/datadomains.html
Types of databases by scope http://www.ncbi.nlm.nih.gov/Class/MLACourse/Modules/Databases/databases_scope.html
Types of databases by level of curation
(Archival vs curated) http://www.ncbi.nlm.nih.gov/Class/MLACourse/Modules/Databases/databases_curation.html
Useful links available from the
homepage
II. Entrez
What is Entrez?
- A system of 33 linked databases
- A text search engine
- A tool for finding biologically
linked data
- Provides a quick view of the content
of most NCBI databases.
From the NCBI homepage, click on
"All Databases" on the toolbar ----> Entrez
Cross-Database Search Page
- Example: presenilin 1 (Shows
number of records in each resource.)
Entrez Nucleotide - collection
of sequences from several sources, including GenBank, RefSeq, and PDB.
Example: Search for this record,
L42110 (all sequence records are given accession numbers)
How do I interpret the information
shown in the sequence record?
From the L42110 record, click on
"Links", select "Related Sequences" and look at the second
record, NM_000021. Note the number of references that include publications about
this sequence and the comments field.
RefSeq records have distinct accession
numbers - 2 characters followed by an underscore character ('_').
RefSeq FAQ: http://www.ncbi.nlm.nih.gov/RefSeq/RSfaq.html
III. Genome Biology Resources
What is the status of the genome
sequencing project for the tomato?
Click on "Genomic Biology"
in the left-hand sidebar
- On the right-hand side, click
"Genome Projects Database". On the left-hand sidebar note the following
links: Statistics, Sequencing Centers.
Where can you find information about
the human HTT gene?
Genomic Biology http://www.ncbi.nlm.nih.gov/Genomes/
- Click on "Gene" in the
left-hand side bar. Search for: HTT [sym] and use the Limits tab to limit
to: homo sapiens
- Genetics Home Reference http://ghr.nlm.nih.gov/
(easy-to-understand about genes and genetic conditions)
IV. Taxonomy
What is the scientific name for the
honeybee?
TaxBrowser - link on the NCBI tool
bar
V. PubMed
MEDLINE database
- Produced by the National Library
of Medicine
- Indexes the health sciences and
life sciences literature
PubMed http://www.pubmed.gov/
- Freely available web site - However,
you'll want to access PubMed via the "Indexes
& Abstracts" list on the Library's website. Choose "PubMed
-- ISU Affiliate" link. This has the "Get it@ISU" link on every
record.
- PubMed vs. PubMed
Central - PubMed Central is a free full-text archive of biomedical and
life sciences journal literature.
VI. Using PubMed
- For additional information:
- Search Tips
- Most functionalities accessed
through tabs: Limits; Preview/Index; etc.
- Recommend using Boolean operators,
AND, OR, NOT. Must be in capital letters!
- Sample Search: bees AND
social behavior AND genetics
- Click on author name
for "AbstractPlus" view.
- Publisher-provided
links to full-text.
- Look at "Medline"
display view to look at the complete database record with fields.
Look at "Citation" display view to see the MeSH terms.
- Limits Tab and Field Tags
- Can limit a search by various
parameters, e.g. dates, language, etc.
- Field tags can be very useful.
For example,
- signal transduction AND
science vs.
- signal transduction AND
science [ta]
- Complete list of tags available
in the "Help" document. Search "tags" ->Search
Field Descriptions and Tags
- Affiliation tag [ad]
- PubMed ID [pmid]
- History and Clipboard tabs
- History: provides review of
previous searches
- Clipboard: Allows you to store
a set of results for a short period of time. Can email or save results
to a text file. Click checkbox for records of interest, then use "Send
To" command -> Clipboard. Then you can email, create text, file,
etc, from the clipboard.
- MeSH (Medical Subject Headings)
- controlled vocabulary terms
that are assigned by indexers
- can view MeSH terms if "Citation
View" is selected.
- Other Features
- Journals Database: locate
complete journal titles from abbreviations and vice versa
- Single Citation Matcher: locate
journals with incomplete citations
- My NCBI: can set up searches
to run automatically and have results emailed to you
- PubMed Bookshelf: collection
of full-text searchable textbooks
VII. Additional Resources - all
of the following resources are listed under "Indexes & Abstracts"
- Biosis Previews - available within
the Web of Knowledge
- CAB Abstracts
- Strong in agriculture, plant
science, animal science.
- Covers international publications.
- Coverage: 1910 - present
- Note: The ISU subscription allows
for 8 simultaneous users.
- Web of Science (part of the
Web of Knowledge)
- Provides access to the ISI
Citation Indexes: Science Citation Index, Social Sciences Citation Index,
Arts & Humanities Citation Index
Science Citation Index: 6,376 covered journals
- Publications are selected
for inclusion in Web of Science based on the following criteria.
- For more information, visit:
http://scientific.thomson.com/knowtrend/essays/selectionofmaterial/journalselection/
(Note: Newer journals may/may not be included, e.g. Functional and Integrative
Genomics, which has been published by Springer since 2000 is not a covered
journal.)
- ISU subscription goes back
to 1945 for the Science Citation Index.
- Can search by topic or do
cited reference searching
- Computer Engineering Resources
- Computer and Information Systems
Abstracts - provides broad coverage of journal articles and conference
papers dealing with computer and network technology and their applications,
as well as developments in theoretical computer science. Coverage is from
1981 - present.
- Compendex - includes engineering-related
journals, conference proceedings, and technical reports. Coverage is from
1884 - present.
- IEEE Xplore - provides full-text
online access to publications of the Institute of Electrical and Electronics
Engineers (IEEE) and the Institution of Electrical Engineers (IEE). Contains
full-text proceedings from IEEE Symposium on Computational Intelligence
in Bioinformatics and Computational Biology; IEEE Symposium on Bioinformatics
and Bioengineering as well as others. Most IEEE publications should be
indexed in Compendex.
- ACM Digital Library - publications
of the Association for Computing Machinery
- Google Scholar
- a subset of the Google database
- will retrieve citations to
book chapters, journal articles, conference proceedings, meeting reports
- is NOT as up-to-date as any
of the above mentioned resources
Questions?
Feel free to email me at: adinkelm@iastate.edu