This book has many references and questions that require you to use the world wide web (WWW). The WWW was designed to use uniform standards so everyone would have equal access to all information. Unfortunately, a number of companies developed proprietary versions that do not work on all computers. In addition, new technology has been added to the original list of ways to deliver information. Therefore, this web appendix provides you with the information you need to get your computer up to speed for all the WWW sites used in Discovering Genomics, Proteomics and Bioinformatics.
Terminology
There are some basic terms that you need to know to use the links in this book.
A more complete list can be found at this location <http://www-personal.umich.edu/~zoe/Glossary.html>.
Browser - a software program that allows you to visually surf the WWW. There are two main browsers: Netscape and Internet Explorer.
URL - Uniform Resource Locator which is a technical way of saying the web address. It is the string of letters, slashes and numbers that allow your browser to see the appropriate page. With newer browsers, you no longer have to type in the "www" portion of the address, though it never hurts to add it.
Frames - When a web page is divided into sections, each section is called a frame. If you visit <http://bio.davidson.edu/courses/Molbio/websearch/SearchingNCBI.html>, you will see two frames of equal size, though the left frame has a lot of text and the right frame says "Web pages from NCBI will appear here....."
Download - to retrieve software or other computer files from a remote location to your computer. For example, you will need to download some software to see certain web page.
Java - this is a programming language that sends a small program over the WWW to run locally on your computer. Java programs are called "applets" meaning small applications or programs. Java is the major area where standards are not respected. Macintosh computers that run operating system 10 (called OS X) are reported to support all forms of Java, both the original standard and the Microsoft derivations. Mac OS 9.1 and earlier only supports the original Java standards. PCs running Microsoft windows operating systems will be able to support both the original and the Microsoft derived forms of Java.
Plug-ins - these are free software add-ons that you can download to update your browser. Plug-ins are needed for some of the newer media for delivering sound and movies.
Browsers
and Platforms
There are
two major computer platforms in
the biology world - Macintosh and PC (which stands for personal computer, also
known as IBM-compatible or Microsoft products). A new comer to the field is
Linux, and its popularity is growing among those who like to tinker and hack
with computers. Since most Linux users also run another platform and because
many plug-ins are not available for Linux, only Macintosh and PC platforms will
be addressed here.
Macintosh
Users
Macintosh
is still popular with many biologists but due to the power of Microsoft, it
has some problems interpreting pages created with Microsoft standards. Furthermore,
Mac users are a small minority of world users and so browser developers do not
always test their products on Macintosh computers. Because of these two reasons,
Mac users will probably want to download both Netscape and Internet Explorer
(often abbreviated IE).
One final note about Macintoshes. As with all computers, the number of programs you can run simultaneously is determined by the amount of RAM you have bought. With Macs, you can set the RAM for each program individually so that you can have more than one open at a time even with very little RAM. However, the down side to this approach is that some big files may not open and you may get an error message. If this happens, you can fix the problem by finding the application that is RAM-limited and click on it once to highlight it but not launch it (you must quit the application if it is already running). Once the application is highlighted, hold down the Apple key and type the letter I while still holding down the apple key. This will bring up a window as shown in figure 1-1.
Figure 1. Screen shot showing how to adjust memory on a Macintosh.
Click in the box next to show: and choose memory. From this window, you can increase the allocation of RAM for any application. In this example (figure 1), the RAM is set for 40,312 K (or 40.312 megabytes). This is much higher than the default setting of 8192 and allows larger files to be viewed.
Figure 2. Screen shot showing finally settings for memory allocation.
Netscape
To download the latest
version of Netscape, go to the Netscape download home page <http://home.netscape.com/products/index.html?cp=brinavbrincs>.
As of this writing, Netscape version 6.1 is still in early form (called beta).
Since there might be some bugs (problems) with this version, this book will
assume you are using 4.x which means the most recent version of Netscape 4.
The current version is called 4.78. Download this by clicking on the link that
says "Netscape Browsers" and follow the directions. By the time the
book is published, version 6.x may be the only
option available. If so, download Netscape 6.x.
Another advantage for Netscape is the built in composer function. Netscape Composer is a free web authoring program which allows you to create your own web pages. Unless you have access to another product, you can use Composer free of charge for your web pages.
IE
The only
advantage IE has over Netscape on a Mac is Java applications. Due to Microsoft's
position in the market, it can set its own standards and expect a majority of
the world to conform. This means that only the Microsoft browser IE 5.x and
later will work with Microsoft Java applets. The newer versions of Microsoft
Java (1.1 and 1.2) may only work on Macintoshes that run with OS X. This means
in a few more years, this Java v. MS-Java conflict will fade into the distant
past.
Due to an agreement between Apple and Microsoft, IE is preloaded on newer machines. If you cannot find it, download IE by going to the Microsoft web page for browsers <http://www.microsoft.com/windows/ie/default.htm>. Click on the download button and follow directions. The current version is 5.5 and soon version 6.x will become the new standard. Download which ever is available.
PC
Users
For PC users,
Windows comes with IE built in. IE performs most functions properly. The only
exception may be chime which will be discussed below. If you need to update
your version of IE, you can go to the download page at this URL <http://www.microsoft.com/windows/ie/default.htm>.
Click on the appropriate button and follow the directions. Netscape does work
on PC's as well and you can obtain a copy from the Netscape Home Page <http://home.netscape.com/computing/download/index.html?cp=hop05hb2>.
Searching
a Web Page for a Particular Term
Here is
a simple problem with a simple solution. Have you ever searched a web page for
a particular word and had trouble finding the word after viewing the right web
site? To find the word, you can simply use the "Find" function of
your web browser and it will find the word for you. This is especially helpful
on web pages that have a lot of text.
Sample
using Find function
Go to this
URL at Cold Spring Harbor <http://www.nobel.se/chemistry/laureates/index.html>.
Up at the very top of your window, click on the "Edit" menu and choose
"Find". When a dialog box appears, type in the word "Mullis"
and hit return. You will see the word highlighted on the page. This is an easy
way to find the content you are looking for rather than having to scroll down
long pages.
Optimizing Your Browser
There are a few web sites that stand
out as places to start. We will visit a few of them here with other sites listed
at the end of this chapter.
Entrez
PubMed - http://www.ncbi.nlm.nih.gov/PubMed/
The first
place to start any project is the previously published literature. Go to the
Entrez PubMed web site to search the biomedical literature. This is run by the
National Center for Biotechnology Information <www.ncbi.nlm.nih.gov>
which is a part of the National Library of Medicine (NLM) and the National Institutes
of Health (NIH).
To access this huge database, type in any word related to biology. You will get a results page that lists all the publications that contain your word or words. The more words you use, the more specific a response you will get. If you click on the top line that has the authors names in blue, you will usually see an abstract for that publication. Occasionally there will be a large box that is a hyperlink which will take you to an online version of the original paper. The publication of science papers is experiencing a revolution of sorts and some journals allow free access to their articles immediately, others have a delay of 6 - 12 months, some never permit free access. When in doubt, click and find out.
From the Entrez page for PubMed, you can also search many other databases, In the upper left corner, there is a box that allows you to select other databases (figure 1-3). For example, you can choose to search the literature (PubMed), protein sequences, nucleotide sequences, 3D structures, whole and partial genomes, population sequence sets, OMIM which is a catalog of human health information, taxonomic definitions, and domains which are sequences that are conserved and have well characterized functions. This is the ultimate in one-stop shopping for genomic information. We will use this a lot.
Figure 3. Screen shot of searchable databases using NCBI's Entrez web site.
Sample
Search of NCBI
Let's try
out a simple search to find a particular nucleotide sequence. Change the search
to "Nucleotide", enter the word "clock" and hit the "GO"
button. You should get a long list of hits that will cover multiple pages. Now
enter the words "fly clock"
. This should give you a very short
list. Find the one for Drosophila and
click on the accession number which is a hyperlink. You will see all the information
about this particular gene, including the protein and DNA sequences. Now change
the search to "clock Drosophila". You should get over 100 hits simply
by changing from fly to Drosophila.
Perform one last search by entering "period and Drosophila melanogaster".
You will still get many hits, even for species that are not flies because they
have descriptions that use the words you searched. Scroll down your list until
you find a sequence that says:
______________________________________________________________________________
AF251241 Protein, Related Sequences, Popset, Taxonomy
Drosophila melanogaster period (per) gene, partial cds
gi|12005687|gb|AF251241.1|AF251241[12005687]
______________________________________________________________________________
The first line has the accession number (AF251241). Below the accession number is line that describes what this hit is. The phrase "partial cds" means this is a partial coding sequence and thus in not complete. On the third line is a list of symbols that tell you a series of other accession numbers that are used in different databases for this particular sequence. On the far left side on the top line are some terms that are also hyperlinks. Click on the phrase "RelatedSequences" and you should get a short list that includes the full length sequence to the gene called period, or per for short.
Google.com
- www.google.com
If you need
to find almost any web page, the best search engine (program that finds URLs
and catalogs all relevant key words) is Google. Go to the Google web site and
you will see a small box. You may type in as many words as you want (within
reason). The more words you enter, the more specific your search will be and
Google assumes you want to find pages that include all of these terms, not one
or the other. If you know exactly what you are looking for, this is a good approach.
If you are just hunting vaguely, start with fewer terms and then add more as
you get a sense of what you are looking for.
Sample
Search
Enter the phrase
DNA microarray and very quickly you will get over 20,000 hits. You can modify
your search and add the term "undergraduate" and see that the list
has been reduced about 20 fold. You could use Google to help you find a good
summer research job.
Protein
DataBase www.rcsb.org/pdb/index.html
This protein
database (PDB) contains all computer
files that can show us the three dimensional (3D) shapes of proteins. There
are several ways to view these structures, but the easiest is to have the free
plug-in called "Chime"
which is produced by MDL (Molecular Design Limited) <http://www.mdlchime.com/chime/>.
You will have to register to get your free copy of the plug-in. Once you have
logged in, you can follow the links to the download page. It works on both Mac
and PC so choose the appropriate one. Once you have installed it, you will need
to restart your browser so the new plug-in can become activated.
Now that you have downloaded the chime plug-in, you are ready to see 3D structures that have file names ending in ".pdb". If you know the PDB file name, you can enter it in the box. If you do not know the PDB ID number, you can use words to search the database (figure 1-4). Using the PDB ID, enter 1AI3, select the "query by PDB id only" box, and click on the "Find a Structure" button. You will see a page that describes isocitrate dehydrogenase (IDH).
Figure 4. Screen shot from PDB web site.
A new browser window will appear. In this window, you will see the amino acid sequence for IDH in the top frame and the structure in the bottom right frame. Don't rotate the protein yet, leave it in its original position. Click on the button at the left, half way down, that says "Secondary Structure". You will see that the amino acids that make up alpha helices are highlighted in red, beta pleated sheets in blue, and bends in yellow. This has occurred in the amino acid sequence as well as the structure.
If you place your mouse over any amino acid in the structure diagram, you will see its has been identified in the black window on the top left side, just under the full sequence. This also happens when you mouse over amino acids in the sequence.
Change from "Secondary Structure" to "Exposure". You will see that amino acids on the surface of the protein are highlighted differently from the rest of the protein. Note the color of the first two amino acids (ME) in the sequence at the top. Using the mouse, find the first amino acid of the protein structure; it is located at the bottom center of the structure frame. Which amino acid is first in the structure? What happened to the first two amino acids?
Finally, click on the reset button at the bottom on the left side. Change the color to yellow. Now use your mouse to find the amino acid sequence YICLRPVRYYQ which begins at amino acid 125 and ends at number 135. Click and drag to highlight these 11 amino acids and notice that this portion of the structure has also been highlighted yellow.
Close the Quick PDB window and you should still have the original page for viewing IDH. Click on "First Glance" and an animated version of IDH should appear. You can choose to turn on and off the different options by clicking on the appropriate boxes.
Now go back one page and click on the "Protein Explorer" button. Next, make sure your window is properly sized and then click on the button to view 1AI3 from the PDB server. Although it takes a while to load, do not do anything until you see a spinning model of IDH. In the upper right frame, you will see a link that says "Explore 1AI3". Click on this and wait until you see a green box that says ready appear below the structure of IDH. A new set of buttons will appear in the top right frame. Click once on the one that says "water" and most of the red balls will turn to spheres of dots. Click again and they disappear. Click on the other buttons to see what happens.
Finally, there are a number of people who have collected some wonderful tutorials on particular molecules. If you want to visit some, try these out to see what can be done with chime scripting.
Other PDB Sites
Protein Explorer- http://www.proteinexplorer.org/
This site is maintained by Eric Martz at the University of Massachusetts who has pushed Chime scripting further than anyone else. Martz has tutorials on using Protein Explorer, How to create chime scripts, and has many tutorials for your edification.Online Molecular Museum - www.clunet.edu/BioDev/omm/gallery.htm
This site is maintained by David Marcey at California Lutheran University. Marcey and his students have created some outstanding tutorials. Click on the link at the bottom of the left side that says "the exhibits".Nucleic Acid Database atlas - http://ndbserver.Rutgers.edu/NDB/ndb.html
This database contains DNA, RNA, protein-nucleic acid structures. This may be useful if you want to look at non-protein structures.
QuickTime
(QT) - http://www.apple.com/quicktime/download/
QuickTime
is a free plug-in that allows you to see movie files. The 15 second biographies
that are a part of the online resources for this textbook utilize the QT plug-in.
The latest version of QT is 5.x and can be downloaded for Macintosh and PC computers
from the Apple web site listed above. Provide the information, choose your platform,
and download.
Sample
Movie
To make
sure your QT is working, you can check out a 15 second biography <http://bio.davidson.edu/courses/genomics/15secbios/15secbios.html>.
Choose your favorite topic and then select a biography to see and hear. You
can stop the movie by using the control buttons.
If the movies do not play properly, then you will need to check your preferences. To do this, choose preferences under the edit menu. Select "Applications" from the list of preferences. You will get a new dialog box; scroll down until you see "MPEG media file" or similar description. Select this line by clicking on it once and then click on the edit button. Make sure the button next to "Plug-in" has been selected and then make sure the most recent version of QuickTime (5.0.2 or greater) appears in the pop-up menu. If it does not, then you will need to select it by searching through your hard drive and locating QuickTime.
Flash
Animations - http://www.macromedia.com/downloads/
Flash is
the software that creates animations for the WWW, TV, movies. It is a very powerful
program that is sold by Macromedia. The plug-in is free and you can download
it from the site above. You will want to choose the option that says "Macromedia
Shockwave Player". Click on this link and follow the directions. It works
for PC and Mac, Netscape and IE.
Sample
Animation
There are
many good educational animations that use Flash. Some are included with this
book. Try out this one that describes how immunoprecipitations are performed.
This is used for one case study in Chapter 2 <http://bio.davidson.edu/courses/genomics/IMPfolder/IMP.html>.
This animation includes sounds so if you are viewing this where it is
OK to turn up the sound, do so now or use headphones. If you are in a library,
you might want to click on the link at the bottom left that will take you to
a silent version.
Adobe
Acrobat Reader - http://www.adobe.com/products/acrobat/readstep.html
Adobe is a software company
that makes a program called Acrobat. Acrobat will convert any text file into
a ".pdf" format
that stands for Portable Document File. Most browsers come with Acrobat Reader
free plug-in, but if you cannot read see a PDF file, then you can download it
from the page above. Be sure to select the free Reader program and not the full
conversion program that costs about $250.
Sample
PDF
Go to PubMed
< www.ncbi.nlm.nih.gov/PubMed/
> and enter these three authors " Evans Skrzynia Burke". You should
get one hit entitled "The complexities of predictive genetic testing".
Click on the hyperlink of the authors' names and the resulting page has the
abstract. Above the title is a box that hyperlinks to the original paper at
the journal's web site. Click on the box and you will see an html version of
the paper. In the upper right hand corner is a link that says "PDF of this
article". Click on this and then click on the "Download" hyperlink
that appears in a small box. This box gives you a short citation for the paper
and tells you the size of the file you are about to download (217K = 217 kilobytes).
Click on the download link and your browser will launch the Acrobat Reader plug-in
so you can see the paper as it appeared in the original journal. It is a very
good paper if you want to read up on this topic.
There are many other good research papers that are freely available at PubMed Central <http://www.pubmedcentral.nih.gov/> which is funded by your tax dollars and another set is available at HighWire Press <http://highwire.stanford.edu/> which is a commercial provider. You can search these two sites for many excellent journals that serve papers in Acrobat format.
Java
- platform specific links
Macintosh - http://www.apple.com/java/
PC - http://www.microsoft.com/java/
As noted above in the definitions, Java is not as universal as it could have been. You will need to got to the appropriate platform link and download the latest Virtual Runtime Machine. Make sure you match your platform, operating system, and virtual runtime machine. Macintoshes tend to work better with IE than Netscape versions 4.7x. As of this writing, Netscape 6.x was still in beta version and was not tested. If you are running a Macintosh on
OS X, you might not have any problems with Java developed by the original standards, or Microsoft standards.
Sample
Java
If you go to the SNP Consortium's database, they make nice use of Java.
Go to this URL <http://snp.cshl.org/db/snp/snp?name=TSC0019265>
and scroll down to the link that says "View Traces":
Click on this link and look at the DNA sequences for these particular single nucleotide polymorphisms. You can click on any of the options and use the scroll bar to view the entire sequence.
Web authoring (free via Netscape Composer)
One reason to keep using Netscape instead
of IE is that Netscape comes with a program that allows you to create your
own web pages - Netscape Composer. If you need to create web pages for your
course work, you can use these links.
A WWW Template for you to use
bio.davidson.edu/courses/genomics/webauthor/template.html
bio.davidson.edu/courses/Molbio/dreamweaver/dreamweaver.html
How
to use Netscape Navigator to Edit your Web Page
bio.davidson.edu/courses/genomics/webauthor/Goldediting/CommuniEdit.html
How
to make Greek Letters
/bio.davidson.edu/courses/genomics/webauthor/webfolder/Greekletters.html
How
to add sounds to your web pages
pratt.edu/~cg520/web_sounds/audio.html
How
to Create Relative Links for your Web Pages
bio.davidson.edu/courses/genomics/webauthor/webfolder/Relativelinks.html
How
to Insert a Chime image in your Web Page
bio.davidson.edu/courses/genomics/webauthor/webfolder/EmbedChime.html
Related Links
How
to evaluate a WWW site
http://bio.davidson.edu/courses/genomics/webauthor/evaluate.html
Web
Standards for this Course
http://bio.davidson.edu/courses/genomics/GPBwebstandards.html
Additional Online References
Automated
Literature Searches via PubCrawler
http://www.gen.tcd.ie/pubcrawler/
You can use this feature of PubMed to be notified of any publications
that fit a description of your design. This is a great way to stay on top
of all the developments in your field of interest.
Bioinformatics Dictionaries
Andrew
C.R. Martin – Lecturer at Reading College, UK
http://sapc34.rdg.ac.uk/~andrew/dictionary/
Human
Genome Project Glossary
http://www.ornl.gov/TechResources/Human_Genome/glossary/
SGD
Glossary
http://genome-www.stanford.edu/Saccharomyces/help/glossary.html
Medical dictionaries
Online
Medical Dictionary
http://cancerweb.ncl.ac.uk/omd/help.html
MedTerms.com
http://www.MedTerms.com/script/main/AlphaIdx.asp?p=A_DICT
Alphabetical listing of many medical terms. You can choose a letter and
browse, or enter a term and search.
Merriam
Webster Medical Dictionary
http://www.intelihealth.com/IH/ihtIH?t=9276&p=~br,IHW%7C~st,408%7C~r,WSIHW000%7C~b,*%7C
Multilingual
Glossary of technical and popular medical terms in nine European Languages
http://allserv.rug.ac.be/~rvdstich/eugloss/welcome.html
CBS
HealthWatch
http://healthwatch.medscape.com/medscape/p/gcommunity/ghome.asp
(searchable drugs, diseases, terms)
DrKoop.com
http://www.drkoop.com/
The
Lightning Hypertext of Disease
http://www.pathinfo.com/index.htm
Cell Biology Terms
Cell
Biology Dictionary
http://www.mblab.gla.ac.uk/~julian/Dict.html
Hyper
Textbook
http://esg-www.mit.edu:8001/esgbio/chapters.html
Human
Genome Glossary of Genetic Terms
http://www.nhgri.nih.gov/DIR/VIP/Glossary/pub_glossary.cgi
Student Guide
to the Human Genome Project
http://www.ornl.gov/hgmis/education/students.html
NOVA
program: Cracking the Code of Life
http://www.pbs.org/wgbh/nova/genome/
Good animations and you can watch the entire program by streaming video,
free of charge.
Pharmacology Web Pages
About.com
(legal drug information)
http://pharmacology.about.com/health/pharmacology/mbody.htm
Food
and Drug Administration Drug Information (includes drugs being evaluated)
http://www.fda.gov/cder/drug/default.htm
RxList.com
http://www.rxlist.com/
The
Access Project list of AIDS medications
http://www.aidsinfonyc.org/network/access/drugs/index.html
© Copyright 2002 Department of Biology, Davidson College, Davidson,
NC 28036
Send comments, questions, and suggestions to: macampbell@davidson.edu