A distributed file system (DFS) is a file system that spans across multiple file servers or multiple locations, such as file servers that are situated in different physical places. Files are accessible just as if they were stored locally, from any device and from anywhere on the network. A DFS makes it convenient to share information and files among users on a network in a controlled and authorized way.
The main reason enterprises choose a DFS is to provide access to the same data from multiple locations. For example, you might have a team distributed all over the world, but they have to be able to access the same files to collaborate. Or in today’s increasingly hybrid cloud world, whenever you need access to the same data from the data center, to the edge, to the cloud, you would want to use a DFS.
A DFS is critical in situations where you need:
A distributed file system (DFS) is a file system that is distributed to and stored in multiple locations, such as file servers that are located in different locales. Files are accessible just as if they were locally stored, from any device at any location. A DFS makes it convenient to share information and files among authorized users on a network in a controlled way.
These are the most common DFS implementations:
NFS stands for Network File System, and it is one example of a distributed file system (DFS). As client-server architecture, an NFS protocol allows computer users to view, store, and update files that are located remotely as if they were local. The NFS protocol is one of several DFS standards for network-attached storage (NAS).
One of the challenges of working with big data is that it is too big to manage on a single server—no matter how massive the storage capacity or computing power that server possesses. After a certain point, it no longer makes economic or technical sense to continue scaling up—to add more and more capacity to that single server. Instead, the data needs to be distributed across multiple clusters (also called nodes) by scaling out to make use of the computing power of each cluster. A distributed file system (DFS) enables businesses to manage the accessing of big data across multiple clusters or nodes, allowing them to read big data quickly and perform multiple parallel reads and writes.
A distributed file system works as follows:
DFS replication is a multiple-master replication engine in Microsoft Windows Server that you can use to synchronize folders between servers on limited bandwidth network connections. As the data changes in each replicated folder, the changes are replicated across connections.
The goal of using a distributed file system is to allow users of physically distributed systems to share their data and resources. As such, the DFS is located on any collection of workstations, servers, mainframes, or a cloud connected by a local area network (LAN).
The advantages of using a DFS include:
To effectively consolidate storage silos, enterprises need a distributed file system (DFS) that can manage multiple use cases simultaneously. It must provide standard NFS, SMB, and S3 interfaces, strong IO performance for both sequential and random IO, in-line variable length deduplication, and frequent persistent snapshots.
It also must provide native integration with the public cloud to support a multicloud data fabric, enabling enterprises to send data to the cloud for archival or more advanced use cases like disaster recovery, agile dev/test, and analytics.
All of this must be done on a web-scale architecture to manage the ever-increasing volumes of data effectively.
To enable enterprises to take back control of their data at scale, Cohesity has built a completely new file system: SpanFS. SpanFS is designed to effectively consolidate and manage all secondary data, including backups, files, objects, dev/test, and analytics data, on a web-scale, multicloud platform that spans from core to edge to cloud.
With Cohesity SpanFS, you can consolidate data silos across locations by uniquely exposing industry-standard, globally distributed NFS, SMB, and S3 protocols on a single platform.
These are among the top benefits of SpanFS: