tools

Software engineering and benchmarking tools.
Araújo M, Pereira JO.  2010.  Evaluating Data Freshness in Large Scale Replicated Databases. INForum – Simpósio de Informática. Abstractpaper128.pdf

There is nowadays an increasing need for database replication, as the construction of high performance, highly available, and large-scale applications depends on it to maintain data synchronized across multiple servers. A particularly popular approach, used for instance byFacebook, is the MySQL open source database management system and its built-in asynchronous replication mechanism. The limitations imposed by MySQL on replication topologies mean that data has to go through a number of hops or each server has to handle a large number of slaves. This is particularly worrisome when updates are accepted by multiple replicas and in large systems. It is however difficult to accurately evaluate the impact of replication in data freshness, since one has to compare observations at multiple servers while running a realistic workload and without disturbing the system under test. In this paper we address this problem by introducing a tool that can accurately measure replication delays for any workload and then apply it to the industry standard TPC-C benchmark. This allows us to draw interesting conclusions about the scalability properties of MySQL replication.

Oliveira JP, Pereira JO.  2013.  Experience with a Middleware Infrastructure for Service Oriented Financial Applications. 28th ACM Symposium on Applied computing (SAC) - Dependable and Adaptive Distributed Systems. Abstractop13.pdf

Financial institutions, acting as financial intermediaries, need to handle numerous information sources and feed them to multiple processing, storage, and display services. This requires filtering and routing, but these feeds are usually provided in custom formats and protocols that are not the best fit for further processing. Moreover, the sheer volume of information and stringent timeliness and reliability requirements make this a substantial task.
In this paper,
i)
we characterize one of these information feeds (the Exchange Data Publisher feed from the NYSE Euronext European Cash Markets) and
ii)
we present and evaluate a dissemination system for this particular feeder based on commodity hardware and open-source message-oriented middleware (Apache Qpid). This allows us to assess the feasibility of this approach and to point out the main challenges to be overcome.