2025-04-18 beginners | Clojure Slack Archive

beginners

2025-04-18T15:39:57.564709Z

I can't wrap my head around what the index is for in ragtime. None of the examples I look at provide any persistent alternative to a hashmap, which means index information is lost between subsequent runs of the application. The state of applied migrations is stored in the database in their own table, so why isn't that table sufficient for quickly determining what migrations have/haven't been applied? What does the index do that's different?

2025-04-18T15:41:33.614319Z

a table is pile of data on the disk, and index is a structure that makes lookups of data in that pile fast

2025-04-18T15:43:30.698939Z

ah, sorry, I see by index you don't mean a database index, you mean ragtimes index

p-himik 2025-04-18T16:01:16.752029Z

From the docs: > An index simply maps IDs to migrations. So it seems to me that the index protocol is an abstraction to handle that mapping, and it can be backed by an SQL table. As a counter-example, Migratus is an SQL-only migration library with a hardcoded way to index migrations, so there's no need for a separate protocol.

2025-04-18T16:07:58.499369Z

But if you look at things that use ragtime, like the duct module, or basically every example I can find, they just use a map. Sure, you can extend the protocol to be backed by a table, but how/when/why? It's all very handwavy

2025-04-18T16:09:50.953809Z

"it maps IDs to migrations" Why do I care about this to the extent that it's a required argument to some of the API methods? Why isn't the table of applied migrations sufficient for mapping IDs to migrations?

2025-04-18T16:10:43.244759Z

So much of this seems to assume I actually know what's going on, when I'm basically on planet ELI5 😅

p-himik 2025-04-18T16:12:20.197849Z

> But if you look at things that use ragtime, like the duct module [...] they just use a map Can you share a link where Duct uses Ragtime with maps? > Why isn't the table of applied migrations sufficient for mapping IDs to migrations? Assuming I'm correct in my previous message, that's just a design decision of Ragtime. If you care only about SQL and don't want to learned deeper, maybe Migratus would be a better fit?

2025-04-18T16:12:28.030179Z

not all migrations have been applied yet

2025-04-18T16:14:31.592149Z

the canonical set of migrations is the whatever, say sql files you have on disk, and the index maps ids to those, the table in the database just tracks which migrations have been applied to a given database

2025-04-18T16:16:18.361919Z

@p-himik https://github.com/duct-framework/migrator.ragtime/blob/758cce7bbbad8acee62e89270e5c61eda9008da9/src/duct/migrator/ragtime.clj#L79

2025-04-18T16:17:25.802399Z

migrate all builds an index over all the migrations, then uses that index to find migrations by id https://github.com/weavejester/ragtime/blob/master/core/src/ragtime/core.clj#L64 https://github.com/weavejester/ragtime/blob/master/core/src/ragtime/core.clj#L71

2025-04-18T16:18:13.071539Z

the first hashmap passed there is what is used as the index, so if you want to do something different, you can pass in something else that supports the index protocol there

2025-04-18T16:19:31.968189Z

> If you care only about SQL and don't want to learned deeper, maybe Migratus would be a better fit? > I do want to learn deeper, that's why I'm asking. Given that everything I find just uses an in-memory map, I don't understand how an index that's blown away on restart helps me efficiently manage migrations. And since duct is supposed to be production ready, I'm assuming the approach it takes is a good example to follow

2025-04-18T16:21:05.904379Z

> the first hashmap passed there is what is used as the index, so if you want to do something different, you can pass in something else that supports the index protocol there > I would have to override the integrant init hook to change the index, which doesn't really seem like intended usage 🤷🏻‍♂️

2025-04-18T16:21:15.289159Z

it is because ragtime is trying to be super generic, so how ever your migrations are stored might not have an efficient way to find them by id, so it builds an in memory index over the ids so it can find the migration for a given id quickly

2025-04-18T16:21:55.046339Z

it is a protocol so if however you are storing your migrations does give you fast lookup by id, then you can just use that if you want

p-himik 2025-04-18T16:22:17.222779Z

> I'm assuming the approach it takes is a good example to follow FWIW, I myself am absolutely not a fan of migrations having anything to do with the run time of an app. IMO they should be relegated to a separate preparation step that's run before your main app.

👍🏻 1

2025-04-18T16:27:58.520489Z

Maybe the thing I'm missing is understanding what happens on restart. So let's say I have 5 migrations. 4 have been applied while running the application, and so they all appear in the in-memory index. I write a fifth migration and shut down the app. The index is lost. I restart the app and... Now what? Do I apply all migrations, which adds the previous 4 to the index and treats them as a noop against the db because they have already been applied before (according to the migrations table in the db), and then adds the fifth to the index and also applies it? Do I just apply the fifth migration? Does that mean the index has no idea about the previous 4 now? Does that cause problems?

2025-04-18T16:36:17.456969Z

unclear what you mean

2025-04-18T16:37:18.758049Z

in general you just migrate up or whatever and it looks at what migrations exist (what sql files or however you are defining migrations) and what migrations have been applied, and applies the ones that exist and haven't been applied

2025-04-18T16:56:25.793549Z

Hmmm... Okay, put it this way: Am I right in understanding that ragtime's index is just a working cache that helps it do its job of applying/rolling back migrations? So it doesn't much matter if it's persistent or not, it'll get built as necessary and used to perform the required action?

2025-04-18T16:56:36.956159Z

correct

👍🏻 1

2025-04-18T16:56:59.029329Z

And it being explicitly passed in is just to offer the ability to tune the impl, if necessary

2025-04-18T16:57:20.969319Z

Though duct implies that it's really not necessary (in the majority case)

2025-04-18T16:57:45.364619Z

It's one of those "if you need it, you'll know" type situations

Clojurians Log v2

beginners