The spreadsheet file contains new or revised insertion site data on 15 sites from 13 old GDP insertion lines. The new sites or revised sequence locations haven't resulted from any new sequencing we've done recently. They've come from my reanalysis of the existing sequencing data and alignments. Ten of the sequences were already in GenBank (and in the FlyBase records) and I've submitted sequence records to GenBank for seven additional sequences. Column D categorizes the type of submission and I've sorted the rows on that column. It's a real mixed bag. Nine of the rows are data for 2nd sites for which there is already data in FlyBase for one site. I don't have rows for the 1st site data unless there is revised data for that site. The data for l(2)k08611 is for a 3rd site. There are already two sites on the second chromosome annotated in FlyBase. This 3rd site is on the third chromosome. Kevin Cook has genetic evidence for insertions on both the 2nd and 3rd chromosome; he may want to send you a p.c. reporting this. He has proposed the symbol P{lacW}k08611c for this insertion. The 2nd site for KG08989 is within the annotated limits of both the PpD6 and toc genes (PpD6 is nested in an intron of toc). The 1st site for KG08989 also hits both genes, but currently the FlyBase record only has the toc gene as an Affected Gene and in the symbol. There are two insertions for which I've entered an Upstream_gene_1. These are insertions that are not within the annotated transcription unit, but are < 500 bp upstream of it. I think it would be appropriate to enter these as Affected genes. The right-most column (AE), labeled "Genome_position_comment" is a comment that I hope you'll enter in the "Comments concerning location" of the FlyBase insertion report.