# NSCI 580A3 fall 2017

 7. Extract just the line of the file that contains the alg-1 "​gene"​ coordinates using **grep** (you will actually get two lines that have the gene coordinates). ​ //What information can be used to distinguish just the genomic coordinates from other features such that you extract only one line of information?//​ \\ \\
8. Repeat steps 4-7 on the compressed file using compressed file friendly commands. \\ \\
9. Extract the lines containing the "​gene" ​coordinates ​for all genes into a new file using **grep**. ​ //How many lines are in your new file?  How many genes do you think C. elegans contains and is the number of lines you extracted consistent with what you would predict? It is likely that there are far more lines than the number of genes you would expect because each gene may have multiple entries. Next week, we'll discuss how to further restrict the search using regular expressions.//​ \\ \\
10. On the original compressed table, extract the lines of the file containing the genomic coordinates of all exons into a new file (any lines that contain the word exon). //How many lines are in your new file?  How many exons do you think each C. elegans gene has on average and is the number of lines you extracted consistent with what you would predict?// \\ \\
11. Transfer the file into your directory on the montgomery lab server. This will serve as confirmation that you completed the exercise. \\ \\