Dell Technologies has been granted a patent for an apparatus that collects data patterns from multiple storage systems and clusters them into data pattern sharing clusters. The identified subsets of data patterns are then provided to the storage systems within each cluster for data deduplication purposes. GlobalData’s report on Dell Technologies gives a 360-degree view of the company including its patenting strategy. Buy the report here.

According to GlobalData’s company profile on Dell Technologies, Device power optimization was a key innovation area identified from patents. Dell Technologies's grant share as of September 2023 was 77%. Grant share is based on the ratio of number of grants to total number of patents.

Data deduplication in storage systems using data pattern sharing clusters

Source: United States Patent and Trademark Office (USPTO). Credit: Dell Technologies Inc

A recently granted patent (Publication Number: US11775483B2) describes an apparatus and method for data deduplication in storage systems. The apparatus includes at least one processing device with a processor and memory. The processing device is configured to collect data patterns from multiple storage systems and cluster them into two or more data pattern sharing clusters based on the collected data patterns. Each cluster consists of a subset of storage systems. The apparatus then identifies subsets of data patterns for each cluster and provides them to the corresponding storage systems for data deduplication.

The identification of data patterns for each cluster involves selecting data patterns collected from data deduplication software running on the storage systems of the first cluster but not utilized by the storage systems of the second cluster. This ensures that each cluster has unique data patterns for efficient deduplication.

In one embodiment, the storage systems in the first cluster implement inline pattern detection using the first subset of collected data patterns. The inline pattern detection utilizes a set of predefined data patterns, with the first subset including at least one data pattern not in the predefined set.

The clustering of storage systems into data pattern sharing clusters is achieved using a mean-shift clustering algorithm. This algorithm utilizes multidimensional scaling to reduce the dimensionality of the collected data patterns.

The apparatus also includes a monitoring and analytics platform, which can be cloud-based, that houses the processing device. The platform collects data patterns from the storage systems and performs the clustering and data deduplication steps.

The patent also describes a computer program product comprising processor-readable storage medium with program code that performs the same steps as the apparatus.

Overall, this patent presents a method and apparatus for efficient data deduplication in storage systems by clustering storage systems based on data patterns and providing subsets of data patterns for each cluster. This approach improves the effectiveness of data deduplication and reduces storage space requirements.

To know more about GlobalData’s detailed insights on Dell Technologies, buy the report here.

Premium Insights

From

The gold standard of business intelligence.

Blending expert knowledge with cutting-edge technology, GlobalData’s unrivalled proprietary data will enable you to decode what’s happening in your market. You can make better informed decisions and gain a future-proof advantage over your competitors.

GlobalData

GlobalData, the leading provider of industry intelligence, provided the underlying data, research, and analysis used to produce this article.

GlobalData Patent Analytics tracks bibliographic data, legal events data, point in time patent ownerships, and backward and forward citations from global patenting offices. Textual analysis and official patent classifications are used to group patents into key thematic areas and link them to specific companies across the world’s largest industries.