Re: [問題] 之前問過的程式加了新條件~~

看板Perl作者 (不要重複製造輪子)時間18年前 (2006/10/24 14:34), 編輯推噓1(104)
留言5則, 1人參與, 最新討論串2/5 (看更多)
: 上面是小弟之前問過的問題~~ : 就是在每個"//"為結尾的檔案做切割輸出 : 後來因為這樣輸出後檔案太多了,一個gbvrl1.seq就可以輸出7萬多筆 : 如此一來我若是先輸出再把我要的檔案去grep出來就太費時了 : 所以我想說能在文件檔中"ORGANISM"欄位裡有提及的名稱如"Enterovirus"等 : 才作輸出如此一來就可以節省不少時間了 : 請問有比較好的做法嗎~~THX 我記得上次我也有回過你了... 對於解析序列檔案... 我認為塞進資料是最好的作法... 不過以在生資所兩年...生資公司快一年的經驗來說... 寫個 power script 就可以對 NCBI 做查詢... 並把查詢的結果以 FASTA 的格式傳回... 才是節省時間...較有效率的辦法! 以下是我寫的程式擷取的部分結果(去除 Nucleotide)... 有興趣打電話給我 3345678...:P >gi|116256796|gb|DQ993173.1| Human coxsackievirus A16 isolate 0249-06 VP1 >gi|9626677|ref|NC_001472.1| Human enterovirus B, complete genome >gi|1839281|gb|S79977.1| swine vesicular disease virus SVDV-specific sequence >gi|73533657|gb|DQ167421.1| Human coxsackievirus B3 isolate p19 5' UTR >gi|73533656|gb|DQ167420.1| Human coxsackievirus B3 isolate p18 5' UTR >gi|73533655|gb|DQ167419.1| Human coxsackievirus B4 isolate p16 5' UTR >gi|73533654|gb|DQ167418.1| Human coxsackievirus B4 isolate p14 5' UTR >gi|73533653|gb|DQ167417.1| Human coxsackievirus B4 isolate p12 5' UTR >gi|73533652|gb|DQ167416.1| Human coxsackievirus B3 isolate p10 5' UTR >gi|73533651|gb|DQ167415.1| Human coxsackievirus B6 isolate p9 5' UTR >gi|73533650|gb|DQ167414.1| Human poliovirus 3 isolate p8 5' UTR >gi|73533649|gb|DQ167413.1| Human echovirus 30 isolate p7 5' UTR >gi|73533648|gb|DQ167412.1| Human coxsackievirus B3 isolate p6 5' UTR >gi|115499492|gb|DQ984529.1| Human coxsackievirus A16 isolate HME-310 5' UTR >gi|61608320|gb|AY843312.1| Enterovirus 86 strain BAN99-10356, partial genome >gi|61608318|gb|AY843311.1| Enterovirus 82 strain OMA98-10391, partial genome >gi|61608316|gb|AY843310.1| Enterovirus 80 strain OMA98-10388, partial genome >gi|61608314|gb|AY843309.1| Enterovirus 79 strain USA/CA82-10385, partial >gi|61608311|gb|AY843308.1| Enterovirus 95 strain CIV03-10361, complete genome >gi|61608308|gb|AY843307.1| Enterovirus 94 strain BAN99-10355, complete genome >gi|61608305|gb|AY843306.1| Enterovirus 88 strain BAN01-10398, complete genome >gi|61608302|gb|AY843305.1| Enterovirus 87 strain BAN01-10396, complete genome >gi|61608299|gb|AY843304.1| Enterovirus 86 strain BAN00-10354, complete genome >gi|61608296|gb|AY843303.1| Enterovirus 85 strain BAN00-10353, complete genome >gi|61608293|gb|AY843302.1| Enterovirus 84 strain USA/TX97-10394, complete >gi|61608290|gb|AY843301.1| Enterovirus 83 strain USA/CA76-10392, complete >gi|61608286|gb|AY843300.1| Enterovirus 82 strain USA/CA64-10390, complete >gi|61608282|gb|AY843299.1| Enterovirus 81 strain USA/CA68-10389, complete >gi|61608279|gb|AY843298.1| Enterovirus 80 strain USA/CA67-10387, complete >gi|61608274|gb|AY843297.1| Enterovirus 79 strain USA/CA79-10384, complete >gi|12408699|ref|NC_002058.3| Poliovirus, complete genome >gi|115500010|dbj|AB275852.1| Human coxsackievirus A14 gene for polyprotein, >gi|115500008|dbj|AB275851.1| Human coxsackievirus A14 gene for polyprotein, >gi|115500006|dbj|AB275850.1| Human coxsackievirus A14 gene for polyprotein, >gi|115500005|dbj|AB275849.1| Human coxsackievirus A14 gene, similar to >gi|115500003|dbj|AB275848.1| Human coxsackievirus A14 gene for polyprotein, >gi|115430550|emb|AM084225.1| Human poliovirus 2 RNA for polyprotein, >gi|115430548|emb|AM084224.1| Human poliovirus 2 RNA for polyprotein, >gi|115430546|emb|AM084223.1| Human poliovirus 2 RNA for polyprotein, -- 我是瓶男~我很難懂! http://blog.yam.com/chhuang -- ※ 發信站: 批踢踢實業坊(ptt.cc) ◆ From: 61.30.74.102

10/24 19:38, , 1F
感謝大大回覆~~為什麼我沒有要照你提議的方法做呢?
10/24 19:38, 1F

10/24 19:39, , 2F
因為老闆要我做一個資料庫給他~~~要這種抓下來分解的
10/24 19:39, 2F

10/24 19:40, , 3F
流程~~~~所以才會要這要弄~~~~^^a
10/24 19:40, 3F

10/24 19:40, , 4F
之前大大說的power script~~這個方向小弟會試試看~~
10/24 19:40, 4F

10/24 19:41, , 5F
那找大大的時間什麼時候方便阿~~3QQ
10/24 19:41, 5F
文章代碼(AID): #15FRFrlS (Perl)
文章代碼(AID): #15FRFrlS (Perl)