ClearForest, a provider of text-driven business intelligence solutions, announced it has been recognized by the National Institute of Standards and Technology (NIST), an agency of the U.S. Commerce Department's Technology Administration, for Arabic language performance in Automatic Content Extraction (ACE).

The ACE program recognizes organizations that make significant advances in automatic content extraction technology, streamlining the processing of source language in text form from three source types: newswire, broadcast news and newspaper. ACE technology is aimed at supporting various classification, filtering, and selection applications by extracting and representing language content (i.e. meaning conveyed by data).(1) Top performance in ACE evaluations requires the development of technologies that automatically detect and characterize this meaning.

ClearForest participated in two primary ACE recognition tasks - Entity Detection and Recognition and Entity Mention Detection. These tasks require software solutions capable of processing language data in documents and then to output, for each document, information about entities, relations and events contained within. Examples of entity types include: person, organization, geo-political entity, location, facility, weapon and vehicle.

Following ACE specifications and guidelines, ClearForest ranked highest in ability to detect entities from within a variety of Arabic documents, distinguish between entity types, and merge selected information into a unified representation for each detected relationship (i.e. cross reference multiple mentions of the same entity from within a single document).

 

Register or login for access to this item and much more

All Information Management content is archived after seven days.

Community members receive:
  • All recent and archived articles
  • Conference offers and updates
  • A full menu of enewsletter options
  • Web seminars, white papers, ebooks

Don't have an account? Register for Free Unlimited Access