I am looking at some implementations of Cassandra and HBase for medium-sized data sets (~1M resources) to be exposed to clients as graphs (via e.g. Tinkerpop). I would also like to store binaries in the same data stores. While it seems like both systems support storing large binaries one way o another (HBase via HDFS) I wonder what the performance implications would be for using these versus flat file storage. Are these systems designed to store binaries at scale, or are they more targeted at metadata storage? I am talking about 100s of Tb of binary data.
Thanks
.s
Asked by gattu marrudu
(21 rep)
Apr 1, 2017, 06:06 AM
Last activity: Nov 29, 2019, 11:02 PM
Last activity: Nov 29, 2019, 11:02 PM