GooSeeker, a community on web data extraction

Far in the information ocean of Web, there are a lot of Treasure Islands waiting for you to explore. Confronting the great waves, you might be in deep sorry that you could not find a strong ship. Fortunately, GooSeeker is just your HISPANIOLA. She can bring you to many Treasure Islands such as:
  • vertical search engines / professional search engines
  • Mashup services and information portals
  • information aggregation and search within enterprise
  • information collection for business intelligence
  • intelligent agents / personalized information retrieval systems
  • information mining facilities
MetaSeeker, a toolkit released by us, provides a series of tools which semantically describe data structures of target Web pages, construct data structure specification files and instruction files for information extraction, continuously extract information in bulk from the Web, produce and store result files with semantic meta data. All above activities are necessary for collecting contents during building up information services.
GooSeeker focus on data schema modeling and data extraction To become a GooSeeker

Facilitiated with the toolkit MetaSeeker, GooSeeker is being built up as a community interested in data schema modeling and data extraction. Here the members can get up-to-date news on GooSeeker's products and activities, take part into discussions on related technology and industries and share knowledge and opinions with each other.

To join GooSeeker, please register;
To obtain the toolkit MetaSeeker, please contact us;
Any question or requirement,please contact us.

Resources

  • Products and Services: Up-to-date information on products and services are published.
  • Documentation: User guides and other documents are published, which users can put comments on.
  • Forums: Users can take part into discussions or share opinions with each other.
  • Editors' corners: Our editors are collecting a great amount of valuable information on which users can put comments.

What we focus on

  • theory and practices on semantic web
  • methods on data extraction & screen scraping
  • methods on web data schema modeling
  • theory and practices on data mining