Any23

From Semantic Web Standards

Apache Any23

Name of the tool: Apache Any23
Home page: http://any23.apache.org/
Date of latest release:
Programming language(s) that can be used with the tool: Java
Relevant semantic web technologies: RDF/XML, Turtle, Notation 3, RDFa Primer, RDFa1.1 prefix mechanism, Microformats, HTML5 Microdata, RDF
Categories: Programming Environment, RDF Generator, Converter, API
See also:
Public mailing list: http://any23.apache.org/mail-lists.html
Preferred project URI:
DOAP reference: http://svn.apache.org/repos/asf/any23/committers/doap Any23.rdf
Company or institution: Apache Software Foundation

(Tool description last modified on 2017-10-26.)

Description

Anything To Triples (Any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a variety of Web documents. Currently it supports the following input formats:

  • RDF/XML, Turtle, Notation 3
  • RDFa with RDFa1.1 prefix mechanism
  • Microformats1 and Microformats2: hAdr, hCard, hCalendar, hEntry, hEvent, hGeo, hItem, hListing, hProduct, hProduct, hRecipie, hResume, hReview, License, Species, XFN, etc
  • JSON-LD: JSON for Linking Data. a lightweight Linked Data format based on the already successful JSON format and provides a way to help JSON data interoperate at Web-scale.
  • HTML5 Microdata: (such as Schema.org)
  • CSV: Comma Separated Values with separator autodetection.
  • Vocabularies: Extraction support for Dublin Core Terms, Description of a Career, Description Of A Project, Friend Of A Friend, GEO Names, ICAL, lkif-core, Open Graph Protocol, BBC Programmes Ontology, RDF Review Vocabulary, schema.org, VCard, BBC Wildlife Ontology and XHTML... and more!
  • YAML: human friendly data serialization standard for all programming languages.

A detailed description of available extractors is here.

Apache Any23 is used in Web of Data, Link Data and Web Crawler applications, it is written in Java and licensed under the Apache License v2.0. Apache Any23 can be used in various ways:

  • As a library in Java applications that consume structured data from the Web.
  • As a command-line tool for extracting and converting between the supported formats.
  • As online service API available at any23.org.

You can download the latest release from our download page.