Fork me on GitHub
#architecture
<
2017-01-31
>
baptiste-from-paris15:01:54

Architecture questions (Disclaimer : I have no idea how to do it) My client has a huge number of CSV files he wants to classify (about 2000Go). Most of the files have different formats. He wants to classify them by same subject. >Is it feasible ?

nha17:01:55

@baptiste-from-paris does not sound impossible. Tedious, maybe (?) but you don’t give a lot of details here. And I am not even sure this is an architecture question, what makes you think so?

baptiste-from-paris17:01:57

@nha let me try to be more specific 1) I only have one format, CSV 2) There are many different files: company products, other about team, budget… 3) I need to find a way to classify them by product, team...

nha17:01:48

So you have different data, with no proper structure. Maybe have a look at something like http://harelba.github.io/q/ ?

baptiste-from-paris17:01:51

thx, I’ll have a look