|
BrightPlanet is the leader in harvesting high quality content from inaccessible Deep Web and Surface Web sources. With over 10 years of Deep Web extraction expertise, the company has developed a heuristic, rule-based expert system for communicating with Deep Web sources that does not require one-off scripts to be built by hand, which are often prone to failure. The fully automatic configuration system configures about 80% of known Deep Web sources without any user intervention. Another 15% of Deep Web sources can be configured with only minor user intervention, requiring only about 5 seconds per source. All remaining Deep Web sources can be configured using an extensive scripting language that also supports password protected and JavaScript sites.
Information is the lifeblood of businesses today. A company's success is now more dependent than ever on its ability to access increasingly comprehensive information within the explosive growth of data now available. Even the most sophisticated analytic tools are crippled by their inability to manage the sheer enormity of data to be consumed, (both structured and now increasingly unstructured data). Further, with access to this geometrically increasing tsunami of data, the ability to find "unknown" and "hidden" content and THEN create qualified, relevant content for analysis is more challenging than ever.
BrightPlanet provides the most comprehensive document harvesting and normalization capabilities in the market today. BrightPlanet's patented software uncovers various harvest techniques;
- Documents from the conventional (or surface) web,
- The much larger, more authoritative Deep Web,
- Proprietary data sources (such as LexisNexis and Dow Jones/Factiva), and
- Customers' own internal data sources.
Content is harvested, federated and normalized, regardless of its source language, document encoding, format, or storage mechanism to provide qualified, relevant data for analysts and analytic technologies.
After four years of direct experience working within the US Intelligence Community (IC), BrightPlanet has achieved a strong reputation and is accepted as the resource for Deep Web Harvesting. BrightPlanet has worked on many IC projects over this time period and still maintains a number of actively developed projects. |
|