锘??xml version="1.0" encoding="utf-8" standalone="yes"?>麻豆精品一区二区综合av,国产精品一区二区久久精品,欧美精品一区二区蜜臀亚洲http://www.aygfsteel.com/Skynet/category/42815.htmlzh-cnThu, 26 Nov 2009 06:01:16 GMTThu, 26 Nov 2009 06:01:16 GMT60shell txt 鍒嗘瀽灝忕粨http://www.aygfsteel.com/Skynet/archive/2009/11/26/303750.html鍒樺嚡姣?/dc:creator>鍒樺嚡姣?/author>Thu, 26 Nov 2009 03:27:00 GMThttp://www.aygfsteel.com/Skynet/archive/2009/11/26/303750.htmlhttp://www.aygfsteel.com/Skynet/comments/303750.htmlhttp://www.aygfsteel.com/Skynet/archive/2009/11/26/303750.html#Feedback0http://www.aygfsteel.com/Skynet/comments/commentRss/303750.htmlhttp://www.aygfsteel.com/Skynet/services/trackbacks/303750.html




]]>
澶ф枃浠跺垏鍓詫紝top鍊掑簭鎺掑垪錛堟柟娉曟瘮濂旈?澶у鍙粫琛岋級http://www.aygfsteel.com/Skynet/archive/2009/11/23/303340.html鍒樺嚡姣?/dc:creator>鍒樺嚡姣?/author>Mon, 23 Nov 2009 06:43:00 GMThttp://www.aygfsteel.com/Skynet/archive/2009/11/23/303340.htmlhttp://www.aygfsteel.com/Skynet/comments/303340.htmlhttp://www.aygfsteel.com/Skynet/archive/2009/11/23/303340.html#Feedback0http://www.aygfsteel.com/Skynet/comments/commentRss/303340.htmlhttp://www.aygfsteel.com/Skynet/services/trackbacks/303340.html
鏁版嵁璇存槑:
knnuu_...txt 鏂囦歡澶у皬 3.2G 鏁版嵁鏍煎紡鏄?nbsp;
user1   user2    score
..
usern   userm    score

鎴戣繖閲屽笇鏈涢氳繃娓呮礂寰楀埌錛?br /> 涓?user1 鍏崇郴鏈榪戠殑 top 100 浜?br />
鐢變簬鏁版嵁騫墮潪闇瑕佺櫨鍒嗕箣鐧懼噯紜紝鎴戞斁寮冨湪鍒嗛殧鍑虹殑鏁版嵁
if len(dr)!=3  : continue
寮浜?7 涓嚎紼?涔熷氨鏄?浼氭湁 7 涓?鐢ㄦ埛 鐨?nbsp; uid 瀵?top 100 uid 浼氬嚭鐜伴棶棰?
瀵瑰簲  鎬葷敤鎴鋒暟鍑犲崄涓囨潵璇?nbsp; 鍛靛懙 ! 鎴戝氨鐢ㄨ繖 瀹屽杽7涓壒孌婁漢鐨勫垪琛ㄦ椂闂村啓涓?blog 鍚?br />

騫剁粨鍚?linux split , awk 絳?蹇熷疄鐜扮殑 鐚ョ悙 澶氱嚎紼?鍝堝搱!!
鎬庝箞淇敼涓?nbsp; 閫熷害鎻愬崌 5鍊嶏紝鍘熸潵鐨?涓灝忔椂 鍒?10澶氬垎閽?銆傘傘傘傘?br />

# split  --bytes=500m  knnuu_20091123.txt knnuu/
#
 ls a* | awk '{system( "  python uu.py "$0" & " )}'
import bsddb,sys
db 
= bsddb.hashopen('../id-item-y-09-10-11.db','c')

uid 
= -1
arr
=[]
arrsc
=[]
fw 
= open('tc/'+sys.argv[1]+'uid-uid-sc.txt','w')
ii
=0

def insertion_sort(arr,arrsc,uid,sc):
    ls 
= min(100,len(arrsc))
    if ls!=0 and sc < arrsc[ls-1] : return
    
for i in xrange(ls):
        
if arrsc[i]<=sc  :
            arrsc.insert(i,sc)
            arr.insert(i,uid)
            
return
        
elif arrsc[i] > sc :  continue
    
if ls < 99 :
        arr.append(uid)
        arrsc.append(sc)

#for row in open('knnuu_20091123.txt') :
for row in open(sys.argv[1]):
    dr 
= row.split('\n')[0].split('\t')
    
if len(dr)!=3 : continue
    u1,u2,strsc 
= dr[0],dr[1],dr[2]

    sc 
= float(strsc)
    
if uid == -1 : uid = u1
    
if u1 != uid :
        
for c in xrange( min(100,len(arrsc)) ):
            tu 
= arr[c]
            ts 
= arrsc[c]
            
print >>fw,"%s\t%s\t%s" % ( db[u1],db[tu],ts )
        
print uid
        fw.flush()
        arr
=[u1]
        arrsc
=[sc]
        uid
=u1
    
else :
        insertion_sort(arr,arrsc,u2,sc)
    ii
+=1
    
#print ii,u1,uid,u2,strsc,len(arr),len(arrsc)
    #if ii>10 : break

fw.close()
                                                                                                                                                                        




]]>
主站蜘蛛池模板: 岗巴县| 徐州市| 噶尔县| 察雅县| 黑河市| 什邡市| 千阳县| 宣威市| 丰原市| 比如县| 宜都市| 崇左市| 临颍县| 历史| 张掖市| 漳浦县| 临汾市| 万州区| 安达市| 阆中市| 阜平县| 马关县| 务川| 丰城市| 开远市| 乳源| 长岛县| 卢氏县| 西林县| 云南省| 容城县| 高清| 宣化县| 兰坪| 朝阳区| 中卫市| 蓝山县| 鱼台县| 邓州市| 兰坪| 梧州市|