RSP Benchmarking

From RDF Stream Processing Community Group

At the F2F meeting in Portoroz [1], we discussed some metrics and RSP system features that should be taken into account at the time of evaluating an RSP engine. This is ongoing work and we invite members of the group to add their suggestions to the table.

RSP System metrics

Tests

  • Soak testing addresses the system performance under the expected production load over a continuous period of time.
  • Stress testing checks the response of the system under heavy loads.

Metrics

  • Memory consumption
  • Query execution time
  • Query throughput
  • Data throughput
  • CPU usage
  • Correctness of results
  • Size of knowledge base
  • Reasoning
  • Caching

Input / Parameters

Query

  • Number of joins
  • Type of join
  • Implies reasoning
  • Number of streams
  • Aggregation functions
  • Selectivity
  • Window size/slide

Data

  • Variety of data (structure and values)

Stream

  • Number of triples / graph
  • Input rate

Background data

  • Location (local vs remote)
  • Size of the data:
    • storable in primary memory
    • storable in secondary memory

Existing works

RSP

DSMS/CEP

Open points/Additional requirements

  • Historical data