Scalability
D ata s h e e t
Scalability & Elasticity
With today’s massive volumes of data, it is imperative to have a database that scales quickly, easily, and at low cost. But, it is
also important to have elasticity—to be able to scale down based upon fluctuating demand. MarkLogic® is a massively scalable
Enterprise NoSQL database that scales horizontally in clusters on commodity hardware to hundreds of nodes, petabytes of data,
and billions of documents—and still processes tens of thousands of transactions per second. And, when demand dissipates,
MarkLogic can scale back down without having to worry about complex sharding. With these features, organizations can handle
incredible volumes of data and run large scale web applications—all without breaking the bank.
Start Small, Go Big
• Scalability – From three nodes to hundreds of nodes, or 10,000 documents to 1 billion documents—MarkLogic clusters scale
horizontally as your data or access demand grows and shrinks
• Elasticity – Add or remove nodes in minutes and take advantage of automatic cluster rebalancing, helping you keep the
database in line with performance needs without over-provisioning
• Run on Commodity Hardware – MarkLogic doesn’t need “big iron.” You can run MarkLogic on cost-effective commodity
hardware in any environment—in the cloud, virtualized, on-premises, or a combination
• Shared Nothing Architecture – MarkLogic uses a shared-nothing architecture with no master-slave relationships, which
means there is no risk of data loss if a node fails. If one node fails, another node automatically picks up the workload
• No Performance Degradation – MarkLogic was designed from the start to run large enterprise applications, and does not
reach a limit where there are large performance cliffs while scaling
• Fewer Nodes and Licenses – MarkLogic datasets and indexes do not have to fit in-memory, which means you can scale
without the expense of dozens of boxes and licenses
• Tiered Storage – As your data volumes grow, you can triage your data to less expensive storage tiers based on cost and
performance trade-offs, making scaling even easier and more cost effective
Scalability & Elasticity
M a r k l o g i c D ata s h e e t
Clustering in MarkLogic
MarkLogic is designed for extremely large data volumes, and scales to clusters of hundreds of machines, each of which runs
MarkLogic. Each machine in a MarkLogic cluster is called a host, or node. Some hosts (Data Managers, or D-nodes) manage
a subset of data in what are called forests (also known as shards). Other hosts (Evaluators, or E-nodes) handle incoming user
queries and internally federate queries across D-nodes to access the data. As you load more data, you add more D-nodes. As the
user load increases, you add more E-nodes.
High Availability
Clustering enables high availability. In the event that an E-node should fail, there is no host-specific state to lose—just the inprocess requests (which can be retried)—and a load balancer can route traffic to the remaining E-nodes. Should a D-node fail,
that subset of the data can be brought online by another D-node.
Commodity Hardware
MarkLogic clusters across commodity hardware connected on a LAN. A commodity server can be anything from a laptop, to a
simple virtualized instance, all the way up to a high-end box with two CPUs—each with 12 cores, 512 gigabytes of RAM, and
either a large local disk array or access to a SAN. A high-end box like this can store terabytes of data.
More Information
• Documentation – Scalability, Failover, and High Availability (http://docs.marklogic.com/guide/cluster)
• Technical White Paper – Fundamentals of Resource Consumption (http://developer.marklogic.com/media/fundamentals-ofresource-consumption.pdf)
About MarkLogic
MarkLogic is the world’s best database for integrating data from silos, providing an operational and transactional Enterprise
NoSQL database platform that integrates data better, faster, with less cost. Visit www.marklogic.com for more information.
© 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. This technology is protected by U.S. Patent No. 7,127,469B2, U.S. Patent No. 7,171,404B2, U.S.
Patent No. 7,756,858 B2, and U.S. Patent No 7,962,474 B2. MarkLogic is a trademark or registered trademark of MarkLogic Corporation in the United States and/or other
countries. All other trademarks mentioned are the property of their respective owners.
MARKLOGIC CORPORATION
999 Skyway Road, Suite 200 San Carlos, CA 94070
+1 650 655 2300 | +1 877 992 8885 | www.marklogic.com | [email protected]
Was this manual useful for you? yes no
Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Download PDF

advertisement