Genbank Information for Drosophila melanogasterCollagen
LOCUS DROCOL 1491 bp DNA INV 23-JUN-1992
DEFINITION Drosophila melanogaster collagen gene, partial cds.
ACCESSION J01074 V00200 NID g157075
KEYWORDS collagen.
SOURCE Drosophila melanogaster DNA.
ORGANISM Drosophila melanogaster Eukaryotae; mitochondrial eukaryotes; Metazoa; Arthropoda; Tracheata; Insecta; Pterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila.
REFERENCE 1 (bases 1 to 1491)
AUTHORS Monson,J.M., Natzle,J., Friedman,J. and Mccarthy,B.J.
TITLE Expression and novel structure of a collagen gene in Drosophila
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 1761-1765 (1982)
MEDLINE 82197577
COMMENT This Drosophila collagen gene may encode a nonfibrous collagen such as a basement membrane or cuticle collagen or a novel nonfibrous protein.
FEATURES Location/Qualifiers source 1..1491 /organism="Drosophila melanogaster" /db_xref="taxon:7227" CDS join(<23..704,767..>1491) /codon_start=1 /product="collagen" /db_xref="PID:g157076"
TRANSLATION=
"GNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGP RGLNGPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNG LQGEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGV AGPPGVPGIDGVRGRDGAKGEPGSPGSVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKR GPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDG MPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGL AVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPG MDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGI RGQPGLPATVPDIRGDKGS"
exon <23..704 intron 705..766 exon 767..>1491
BASE COUNT 285 a 386 c 529 g 291 t
ORIGIN 157 bp upstream of SacII site. 1 cgaaattaag atgcccgcca agggtaacaa gggtgagccc ggccaaaccg gcatgccagg 61 acctccggga gaagacggca gcccgggaga gaggggctat accggattga agggcaacac 121 tggaccacag ggacctcctg gcgttgaagg accccgcggc ttgaatggac ctcgcggtga 181 aaagggcaac cagggcgctg tcggagtacc tggtaatcct ggcaaggacg gccttcgcgg 241 cattcccgga cgcaatggac agcctggacc gaggggagag cctggtattt cgagacccgg 301 ccctatgggc ccacccggtc tcaatggtct gcaaggtgag aagggcgacc gtggtccaac 361 cggacccatt ggttttcccg gtgccgatgg cagtgtggga tatcctggag atagaggcga 421 tgccggtctg cccggagtat ctggacgtcc cggaattgtt ggtgagaagg gagacgtggg 481 cccgatcgga cccgctggtg ttgccggacc tcctggtgtt cctggtattg atggtgtgcg 541 tggacgtgat ggcgccaagg gtgagcccgg cagtcccgga tcggtcggca tgcccggtaa 601 caaaggtgac cgtggtgctc ctggaaatga cggacccaag ggctttgctg gcgttactgg 661 tgctcccgga aagcgcggac ctgctggtat tcccggagtt tccggtaagt ttttgcagta 721 tcccattcga acatttgatg tgaaccattt ggtttcattt gttcaggtgc caagggtgac 781 aagggcgcta ctggcttgac tggcaacgat ggacctgtgg gaggccgcgg tcctccaggt 841 gctcctggac tgatgggcat taagggtgac caaggattgg caggcgcccc tggacaacaa 901 ggactggacg gtatgcctgg cgaaaagggt aaccaaggat tccccggtct ggatggacct 961 cctggtttgc ctggagatgc ctccgagaaa ggacaaaagg gtgaacccgg tccatccgga 1021 ctccgcggcg atacaggtcc ggccggaacg cccggttggc caggagagaa gggtttgccc 1081 ggtctggctg ttcacggtcg tgctggtccg ccaggcgaga agggtgacca gggacgcagt 1141 ggaatcgatg gacgagatgg aattaacggc gagaagggtg aacaaggtct gcagggcgtt 1201 tggggccagc ctggcgagaa gggatctgtc ggcgcacctg gtattcctgg tgctcccgga 1261 atggatggtt tgcccggcgc tgctggtgct cctggtgctg ttggctatcc tggcgatcgc 1321 ggtgacaagg gagagcctgg tctatctggt ctgcccggac tcaagggtga gactggaccc 1381 gttggactgc agggcttcac cggtgctcct ggccctaagg gtgagcgcgg tattcgtggt 1441 cagcccggtc ttccggccac cgttcccgac attcgtggtg ataagggatc c