I'm not quite sure how to explain the issue, or even if it is an issue, but will try. Ok, data set, say:
-- Dimensions --
Color
Location
Country
Region
City
Product
Manufacturer
Date (Y-M-D)
-- Measures --
Views
Sales
Amount
Ok, good enough. Large dataset with a bunch of shit. What I need is to create various summaries of the data, such as for example, the changes in views and sales week-to-week. However, it has to be any combination of the dimensions. For example, just by a single color, or very in-depth such as color red, location Dallas, TX, USA, product ABC from manufacturer XYZ on a certain date.
Obviously I could query the raw data for this, but I would prefer to generate summary table(s) to ease resource usage. Right now, it's just creating a row for every possible combination of the dimensions, and I don't know, but that just seems bulky to me.
I'm by no means an expert, but have been doing through various things on linear algebra, statistics, machine learning, etc... so I have a decent grasp of matrixes, vectors, etc. And maybe I'm wrong, but it just seems there might be a way to store all this is a condensed matrix of some sort, then via various mathematical algorithms, easily and quickly pull out the various data I need for any combination.
Any tips?
-- Dimensions --
Color
Location
Country
Region
City
Product
Manufacturer
Date (Y-M-D)
-- Measures --
Views
Sales
Amount
Ok, good enough. Large dataset with a bunch of shit. What I need is to create various summaries of the data, such as for example, the changes in views and sales week-to-week. However, it has to be any combination of the dimensions. For example, just by a single color, or very in-depth such as color red, location Dallas, TX, USA, product ABC from manufacturer XYZ on a certain date.
Obviously I could query the raw data for this, but I would prefer to generate summary table(s) to ease resource usage. Right now, it's just creating a row for every possible combination of the dimensions, and I don't know, but that just seems bulky to me.
I'm by no means an expert, but have been doing through various things on linear algebra, statistics, machine learning, etc... so I have a decent grasp of matrixes, vectors, etc. And maybe I'm wrong, but it just seems there might be a way to store all this is a condensed matrix of some sort, then via various mathematical algorithms, easily and quickly pull out the various data I need for any combination.
Any tips?