Many organizations can benefit from modeling the real-world things they care about, including their attributes and relationships, using an entity-based data model. Examples including consolidating customer information from various sources, fusing information about potential threats and assets in an area of interest from multiple types and sources of intelligence, and coalescing patient data for a more complete view and better diagnoses.
We discuss our experiences building an application for modeling data as entities on Accumulo, outline which parts of the Accumulo API are best suited for this, and explore various trade-offs that can be useful when considering flexibility and performance.
Aaron has built multiple, large-scale, big data systems that are used by the intelligence, defense, finance and healthcare industries. Aaron is the CTO and co-founded of Koverse Inc. Prior to that, Aaron was a researcher for the National Security Agency (NSA) where he founded the Apache Accumulo project, a scalable and secure data store. He is the author of the O’Reilly book, Accumulo: Application Development, Table Design, and Best Practices.