Lec4 Databases
Lec4 Databases
Zoya Khalid
[email protected]
Data Vs. Information
• Primary Databases
– Original submissions by experimentalists
– Content controlled by the submitter
• Examples: GenBank, Trace, SRA, SNP, GEO
• Derivative Databases
– Derived from primary data
– Content controlled by third party (NCBI) Algorithms
• Examples: NCBI Protein, Refseq, TPA, RefSNP, GEO datasets, UniGene, Homologene,
Structure, Conserved Domain
A flat-file database
Why Flat Files ?
• Flat files are the universal mechanism for moving data from one
database or system to another.
• There are two common types of flat files: CSV (comma separated
values) and delimited files.
Relational databases
Relational database
• Enzyme classifications EC