doc.

The Silk Link Discovery Framework is a tool for discovering relationships between data items within different Linked Data sources.
Data publishers can use Silk to set RDF links from their data sources to other data sources on the Web.
The official homepage of Silk can be found on http://silk.wbsg.de

Introduction

The Web of Data is built upon two simple ideas: First, to employ the RDF data model to publish structured data on the Web. Second, to set explicit RDF links between data items within different data sources. Background information about the Web of Data is found at the wiki pages of the W3C Linking Open Data community effort, in the overview article Linked Data – The Story So Far and in the tutorial on How to publish Linked Data on the Web.

The Silk Link Discovery Framework supports data publishers in accomplishing the second task. Using the declarative Silk Link Specification Language , developers can specify which types of RDF links should be discovered between data sources as well as which conditions data items must fulfill in order to be interlinked. These linkage rules may combine various similarity metrics and can take the graph around a data item into account, which is addressed using an RDF path language. Silk accesses the data sources that should be interlinked via the SPARQL protocol and can thus be used against local as well as remote SPARQL endpoints.

Silk can be used through the Silk Workbench web application or on the command line.

All variants are based on the Silk Link Discovery Engine which offers the following features:

Flexible, declarative language for specifying linkage rules
Support of RDF link generation
Employment in distributed environments
Usable in situations where terms from different vocabularies are mixed and where no consistent RDFS or OWL schemata exist
Scalability and high performance through efficient data handling :
- Reduction of network load by caching and reusing of SPARQL result sets
- Multi-threaded computation of the data item comparisons
- Optional blocking of data items

Silk Workbench

Silk Workbench is a web application which guides the user through the process of interlinking different data sources.

Silk Workbench offers the following features:

It enables the user to manage different sets of data sources and linking tasks.
It offers a graphical editor which enables the user to easily create and edit link specifications.
As finding a good linking heuristics is usually an iterative process, the Silk Workbench makes it possible for the user to quickly evaluate the links which are generated by the current link specification.
It allows the user to create and edit a set of reference links used to evaluate the current link specification.

Command Line Tools

Silk is provided in three different variants which address different use cases:

Support and feedback

For questions and feedback please use the Silk Google Group.

Download

All stable releases can be downloaded.
The framework can be used under the terms of the Apache Software License.

The latest source code is available in the Git repository.

Version History

Version	Comment	Release Date
2.5.3	Various improvements and bugfixes.	2012-03-06
2.5.2	Added support for active learning of linkage rules.	2011-11-17
2.5.1	Various improvements to the Workbench.	2011-10-21
2.5	Added support for learning linkage rules.	2011-10-03
2.4.2	SPARQL/Update output. Dump file input. Improved indexing. Many improvements to the Silk Workbench	2011-07-22
2.4.1	Introduces per-comparison thresholds. New data transformations and distance measures including token-based distance measures. Improved Silk Workbench.	2011-07-01
2.4	Includes the new Silk Workbench, a web application which guides the user through the process of interlinking different data sources.	2011-06-01
2.3	Improved loading perfomance: Multiple parallel SPARQL queries are executed, while their results are merged on the fly. Improved matching performance: New blocking method offers greatly improved performance. Improved overall performance: Matching tasks are now executed concurrently to loading data instead of waiting for the complete data set to be loaded.	2011-01-31
2.2	Added Silk MapReduce	2010-10-06
2.1	Added Silk Server Added a geographical distance metric by Konrad Höffner Bugfixes	2010-09-15
2.0	Reimplementation of the Silk framework in Scala. Improved scalability and performance. Prematching replaced by a more transparent blocking. Configuration is checked for consistency prior to link generation. Support of the OAEI Alignment format.	2010-07-01
0.2	Added prematching of data items . The Silk 0.2 language specification is still available and Silk 0.2 framework can be downloaded from GoogleCode.	2009-03-02
0.1	Initial Release of the Python version of the Silk framework	2009-02-01