solo

          Solo是一個元搜索引擎,即從現有搜索引擎中提取數據的程序。

          統計

          留言簿(1)

          相關鏈接

          閱讀排行榜

          評論排行榜

          2007年12月13日 #

          Solo用途

          1. 產品即時比價,為購買產品提供參考;
          2. 產品規格收集;

          posted @ 2007-12-27 09:43 solo 閱讀(220) | 評論 (0)編輯 收藏

          20071220截圖

          結果頁面


           

          posted @ 2007-12-20 22:57 solo 閱讀(231) | 評論 (0)編輯 收藏

          TODOs and Issues

          TODO List:

          1. Identify attributes in a web page
          2. Deal with multiple attributes in a single line while comparing
          3. Show already mapped attributes in compare dialog
          4. Filter "related product" area to reduce # of hunks (by identifying instance URLs in webpage)
          5. Use 3rd party (oss) Java diff library, to remove "org.eclipse.compare" dependency
          6. [Web]Add attribute value filtering options in search page
          7. Add "washer" or "MessageFormat" to attribute entry
          8. Specify whether an attribute is long text (e.g. description) or image URL
          9. Add popularity property to ore, evaluate it by speed, usage, etc.
          10. Solo data partition
          11. Show downloading progress bar in web interface
          12. Added order property to Attribute
          13. Result page columns categorized by ores
          14. Give different thread pool size to user according to his level, default = 3
          15. Ores of a category should be derived, like attributes inheritance
          16. Solve the problem that one ore maps attributes differently in different categories
          17. Model advanced search of ores
          18. Automatically discover search url pattern of ores
          19. Convert relative HREFs to absolute so that they can be recongnized by instance url pattern
          20. Add test query keyword for Category (or Ore) as an attribute, for easy testing purpose
          21. Ability to map multiple attributes in web page to one
          22. Package as rcp product
          23. Mark as "not available" for an attribute of ores
          24. Cache most recent downloaded web pages, for re-compare purpose
          25. Remove tag content to reduce hunks
          26. Remove unique content in product url to reduce hunks

          Issue List:

          1. [Desktop]Concurrently download test pages in comparing dialog.
          2. Remove org.eclipse.swt dependency from solo model
          3. Instance url pattern of Ore should be multiple (allow an ore has multiple instance url pattern)
          4. Use relative path for default.solo
          5. Clear prior mapping when an attribute is assigned again, provide "remove mapping" button
          6. Add progress indicator for attribute extraction dialog while refresh comparison area
          7. Add as test instance URL when two URLs are entered to be compared
          8. Allow mapping multiple attributes in mapping dialog without pressing OK button
          9. Add add/remove category/attribute function
          10. Provide category selection function in editing ore dialog
          11. Replace compare area with Table for better performance

          posted @ 2007-12-17 19:39 solo 閱讀(298) | 評論 (0)編輯 收藏

          20071216截圖

          Web界面搜索結果的大概結構:


          posted @ 2007-12-16 23:00 solo 閱讀(188) | 評論 (0)編輯 收藏

          20071212網頁界面

          和離線編輯器比起來,供大多數人使用的web界面要簡單很多,如果不考慮用戶管理,大概就是一個搜索界面。

          posted @ 2007-12-13 00:01 solo 閱讀(190) | 評論 (0)編輯 收藏

          主站蜘蛛池模板: 泸溪县| 宜黄县| 团风县| 长宁区| 金寨县| 垦利县| 泾川县| 乌审旗| 武定县| 九台市| 绥宁县| 同仁县| 克东县| 罗定市| 宾阳县| 黄梅县| 博爱县| 大田县| 浦城县| 永济市| 新密市| 宝清县| 南投县| 平山县| 高平市| 龙陵县| 资源县| 商都县| 天水市| 米泉市| 都安| 漳浦县| 白水县| 马鞍山市| 安西县| 康平县| 长岛县| 岑溪市| 龙陵县| 禄劝| 西贡区|