锘??xml version="1.0" encoding="utf-8" standalone="yes"?>亚洲国产精品久久久久久,自拍偷拍亚洲一区,亚洲天堂av电影http://www.aygfsteel.com/Skynet/category/42815.htmlzh-cnThu, 26 Nov 2009 06:01:16 GMTThu, 26 Nov 2009 06:01:16 GMT60shell txt 鍒嗘瀽灝忕粨http://www.aygfsteel.com/Skynet/archive/2009/11/26/303750.html鍒樺嚡姣?/dc:creator>鍒樺嚡姣?/author>Thu, 26 Nov 2009 03:27:00 GMThttp://www.aygfsteel.com/Skynet/archive/2009/11/26/303750.htmlhttp://www.aygfsteel.com/Skynet/comments/303750.htmlhttp://www.aygfsteel.com/Skynet/archive/2009/11/26/303750.html#Feedback0http://www.aygfsteel.com/Skynet/comments/commentRss/303750.htmlhttp://www.aygfsteel.com/Skynet/services/trackbacks/303750.html




]]>
澶ф枃浠跺垏鍓詫紝top鍊掑簭鎺掑垪錛堟柟娉曟瘮濂旈?澶у鍙粫琛岋級http://www.aygfsteel.com/Skynet/archive/2009/11/23/303340.html鍒樺嚡姣?/dc:creator>鍒樺嚡姣?/author>Mon, 23 Nov 2009 06:43:00 GMThttp://www.aygfsteel.com/Skynet/archive/2009/11/23/303340.htmlhttp://www.aygfsteel.com/Skynet/comments/303340.htmlhttp://www.aygfsteel.com/Skynet/archive/2009/11/23/303340.html#Feedback0http://www.aygfsteel.com/Skynet/comments/commentRss/303340.htmlhttp://www.aygfsteel.com/Skynet/services/trackbacks/303340.html
鏁版嵁璇存槑:
knnuu_...txt 鏂囦歡澶у皬 3.2G 鏁版嵁鏍煎紡鏄?nbsp;
user1   user2    score
..
usern   userm    score

鎴戣繖閲屽笇鏈涢氳繃娓呮礂寰楀埌錛?br /> 涓?user1 鍏崇郴鏈榪戠殑 top 100 浜?br />
鐢變簬鏁版嵁騫墮潪闇瑕佺櫨鍒嗕箣鐧懼噯紜紝鎴戞斁寮冨湪鍒嗛殧鍑虹殑鏁版嵁
if len(dr)!=3  : continue
寮浜?7 涓嚎紼?涔熷氨鏄?浼氭湁 7 涓?鐢ㄦ埛 鐨?nbsp; uid 瀵?top 100 uid 浼氬嚭鐜伴棶棰?
瀵瑰簲  鎬葷敤鎴鋒暟鍑犲崄涓囨潵璇?nbsp; 鍛靛懙 ! 鎴戝氨鐢ㄨ繖 瀹屽杽7涓壒孌婁漢鐨勫垪琛ㄦ椂闂村啓涓?blog 鍚?br />

騫剁粨鍚?linux split , awk 絳?蹇熷疄鐜扮殑 鐚ョ悙 澶氱嚎紼?鍝堝搱!!
鎬庝箞淇敼涓?nbsp; 閫熷害鎻愬崌 5鍊嶏紝鍘熸潵鐨?涓灝忔椂 鍒?10澶氬垎閽?銆傘傘傘傘?br />

# split  --bytes=500m  knnuu_20091123.txt knnuu/
#
 ls a* | awk '{system( "  python uu.py "$0" & " )}'
import bsddb,sys
db 
= bsddb.hashopen('../id-item-y-09-10-11.db','c')

uid 
= -1
arr
=[]
arrsc
=[]
fw 
= open('tc/'+sys.argv[1]+'uid-uid-sc.txt','w')
ii
=0

def insertion_sort(arr,arrsc,uid,sc):
    ls 
= min(100,len(arrsc))
    if ls!=0 and sc < arrsc[ls-1] : return
    
for i in xrange(ls):
        
if arrsc[i]<=sc  :
            arrsc.insert(i,sc)
            arr.insert(i,uid)
            
return
        
elif arrsc[i] > sc :  continue
    
if ls < 99 :
        arr.append(uid)
        arrsc.append(sc)

#for row in open('knnuu_20091123.txt') :
for row in open(sys.argv[1]):
    dr 
= row.split('\n')[0].split('\t')
    
if len(dr)!=3 : continue
    u1,u2,strsc 
= dr[0],dr[1],dr[2]

    sc 
= float(strsc)
    
if uid == -1 : uid = u1
    
if u1 != uid :
        
for c in xrange( min(100,len(arrsc)) ):
            tu 
= arr[c]
            ts 
= arrsc[c]
            
print >>fw,"%s\t%s\t%s" % ( db[u1],db[tu],ts )
        
print uid
        fw.flush()
        arr
=[u1]
        arrsc
=[sc]
        uid
=u1
    
else :
        insertion_sort(arr,arrsc,u2,sc)
    ii
+=1
    
#print ii,u1,uid,u2,strsc,len(arr),len(arrsc)
    #if ii>10 : break

fw.close()
                                                                                                                                                                        




]]>
主站蜘蛛池模板: 莆田市| 三台县| 乐至县| 鲜城| 镇原县| 南漳县| 新龙县| 阿克| 逊克县| 贵溪市| 汝阳县| 桐梓县| 八宿县| 汉中市| 南充市| 怀化市| 潢川县| 拜泉县| 鄂尔多斯市| 安塞县| 色达县| 五华县| 乾安县| 红桥区| 道孚县| 江阴市| 波密县| 舟山市| 建水县| 淮安市| 报价| 平乐县| 阿合奇县| 秭归县| 彭水| 肥东县| 南乐县| 资兴市| 汉中市| 林口县| 南京市|