Fork me on GitHub

Architecture questions (Disclaimer : I have no idea how to do it) My client has a huge number of CSV files he wants to classify (about 2000Go). Most of the files have different formats. He wants to classify them by same subject. >Is it feasible ?


@baptiste-from-paris does not sound impossible. Tedious, maybe (?) but you don’t give a lot of details here. And I am not even sure this is an architecture question, what makes you think so?


@nha let me try to be more specific 1) I only have one format, CSV 2) There are many different files: company products, other about team, budget… 3) I need to find a way to classify them by product, team...


So you have different data, with no proper structure. Maybe have a look at something like ?


thx, I’ll have a look