 So, databases not only store data but they also allow us for data retrieval and we can also do some downstream analysis on that data sets. Let's see how we can get the DNA data from NCBI. So here is the web page of NCBI. For example, I want to search for my favorite gene P53 basically it's a gene that is tumor spressor gene. So mostly the people who are studying cancers they must be aware of it. So I put P53 in the search box then I click on go button and here we get the results. So we can find many ID entries like our 9000 entries are there. We are just looking into the first page and in this I just pick up the first two. So let's we click at the top one which is a P53 the ID is 27686777. There is a description that what this gene is it's actually coming from Rosophila and the location is it's coming from chromosome number 3 and we see some what they call them as aliases. These are basically some alternative names of this gene. So now then we move on down below in the click on the first gene and then we come up to this page which we have at the top we have actually it was a big page but I chopped off into small sections. So here we see the summary of this gene. So official symbol it's P53 provided by FlyBase which is also written in the primary source. So it's coming from the FlyBase. So that is a database that stores the genome of this fruit fly Rosophila. Then the locus tag and then the gene type it is protein coding. Refseq status says reviewed. So sometimes the genes are submitted and reviewed by some other scientists. So this gene has been reviewed. In the organism section we see the classification of that organism and then obviously it's as we have seen earlier the aliases are written down below. So we can look into it's structure of this gene and it's coordinates genomic coordinates where we can see the location from where it is coming from. We can also see the orientations the directions in which it is going down below. In genomic regions transcripts and products tab we can look into it's products. So the genes when they are expressed the DNA is then converted into RNA and since it's a eukaryotic genome where there is alternative splicing. So we can find different alternative splice variants of this gene. Then on the towards upper right side there is written as go to nucleotides, graphics, FASTA and gene bank. So that is the different views with which I can get access to the data files associated with this gene. So let's click gene back and we see this entry in gene bank how does it look. Again we see the name of the gene we see locus so that is where it's ID is written. Length of the gene it's 4,426 base pairs DNA is a linear type of the DNA then we have the submission date then the definition line that is describing the organisms name and the chromosome from which it's coming then we have accession in the regions in the genome from which it's coming then obviously we have the variant so it says 0,3337.3 so they should have been 0.1, 0.2 and since this is the third review so we see that thing over there in this dot 3 with the name. Down below we also see the reference or the authors from which this gene is coming and then their publication so it was published in science. We scroll down here we see the features of this gene so we see the total length of the gene is 4,426 down below we see messenger RNA so messenger RNA since it's being a eukaryotic gene it's coming from the exons so the regions from which it's coming they are shown below with the word join then within that messenger RNA we find coding sequences so coding sequences are the parts of messenger RNA which are translated into the proteins so they are further subsets within the those messenger RNA regions and down below we see the translated variant where we can see the word is written as translation where we can see the amino acid sequence coming from this gene. In the end till we reach the word called as origin and here we can see the actual nucleotide sequences which are present starting from one up till the last nucleotide and the sequence ends with this double slash sign. So DNA sequences are stored in DNA databases in specific format and gene bank is kind of a standard format.