solo

          Solo是一個(gè)元搜索引擎,即從現(xiàn)有搜索引擎中提取數(shù)據(jù)的程序。

          統(tǒng)計(jì)

          留言簿(1)

          相關(guān)鏈接

          閱讀排行榜

          評(píng)論排行榜

          2007年12月16日 #

          Solo用途

          1. 產(chǎn)品即時(shí)比價(jià),為購(gòu)買(mǎi)產(chǎn)品提供參考;
          2. 產(chǎn)品規(guī)格收集;

          posted @ 2007-12-27 09:43 solo 閱讀(220) | 評(píng)論 (0)編輯 收藏

          20071220截圖

          結(jié)果頁(yè)面


           

          posted @ 2007-12-20 22:57 solo 閱讀(231) | 評(píng)論 (0)編輯 收藏

          TODOs and Issues

          TODO List:

          1. Identify attributes in a web page
          2. Deal with multiple attributes in a single line while comparing
          3. Show already mapped attributes in compare dialog
          4. Filter "related product" area to reduce # of hunks (by identifying instance URLs in webpage)
          5. Use 3rd party (oss) Java diff library, to remove "org.eclipse.compare" dependency
          6. [Web]Add attribute value filtering options in search page
          7. Add "washer" or "MessageFormat" to attribute entry
          8. Specify whether an attribute is long text (e.g. description) or image URL
          9. Add popularity property to ore, evaluate it by speed, usage, etc.
          10. Solo data partition
          11. Show downloading progress bar in web interface
          12. Added order property to Attribute
          13. Result page columns categorized by ores
          14. Give different thread pool size to user according to his level, default = 3
          15. Ores of a category should be derived, like attributes inheritance
          16. Solve the problem that one ore maps attributes differently in different categories
          17. Model advanced search of ores
          18. Automatically discover search url pattern of ores
          19. Convert relative HREFs to absolute so that they can be recongnized by instance url pattern
          20. Add test query keyword for Category (or Ore) as an attribute, for easy testing purpose
          21. Ability to map multiple attributes in web page to one
          22. Package as rcp product
          23. Mark as "not available" for an attribute of ores
          24. Cache most recent downloaded web pages, for re-compare purpose
          25. Remove tag content to reduce hunks
          26. Remove unique content in product url to reduce hunks

          Issue List:

          1. [Desktop]Concurrently download test pages in comparing dialog.
          2. Remove org.eclipse.swt dependency from solo model
          3. Instance url pattern of Ore should be multiple (allow an ore has multiple instance url pattern)
          4. Use relative path for default.solo
          5. Clear prior mapping when an attribute is assigned again, provide "remove mapping" button
          6. Add progress indicator for attribute extraction dialog while refresh comparison area
          7. Add as test instance URL when two URLs are entered to be compared
          8. Allow mapping multiple attributes in mapping dialog without pressing OK button
          9. Add add/remove category/attribute function
          10. Provide category selection function in editing ore dialog
          11. Replace compare area with Table for better performance

          posted @ 2007-12-17 19:39 solo 閱讀(298) | 評(píng)論 (0)編輯 收藏

          20071216截圖

          Web界面搜索結(jié)果的大概結(jié)構(gòu):


          posted @ 2007-12-16 23:00 solo 閱讀(188) | 評(píng)論 (0)編輯 收藏

          主站蜘蛛池模板: 克山县| 丰都县| 客服| 洞口县| 东源县| 滨海县| 乐昌市| 库尔勒市| 方城县| 张掖市| 顺昌县| 什邡市| 灵武市| 乌鲁木齐市| 噶尔县| 怀集县| 吴川市| 咸宁市| 天门市| 太仓市| 邯郸县| 张掖市| 漳浦县| 东丰县| 湟中县| 商水县| 宜兰县| 黎川县| 泾川县| 荥阳市| 密云县| 安徽省| 镇康县| 中阳县| 新丰县| 扶沟县| 茶陵县| 大冶市| 武隆县| 合山市| 桂阳县|