2025-10-26 datahike | Clojure Slack Archive

datahike

Aviv 2025-10-26T15:07:58.869639Z

Hey everyone, I’m running a loop where each iteration creates a new database, runs some queries, and then deletes it afterward. However, I’ve noticed that the query time keeps increasing with each iteration — in some cases, it can take up to ~70 seconds just to recreate the DB and run the query. The total number of objects transacted is around 1,300. Any idea what might be causing this? My hunch is that it’s related to the repeated DB creation, even though I’m calling release and delete each time.

Aviv 2025-10-27T07:03:29.684149Z

It’s a bit hard to provide a concrete example right now. I’ve tried disabling keep-history?, and that seems to have helped for the moment. I’ll keep an eye on it to see if the issue persists — it might be related to an internal issue

timo 2025-10-26T15:10:45.136919Z

In-memory DB?

Aviv 2025-10-26T15:13:33.825759Z

Yes

timo 2025-10-26T20:04:00.231969Z

what's the memory footprint during the runs?

whilo 2025-10-27T02:14:37.162989Z

Hey @aviv846, thanks for reporting this. Can you provide an example to reproduce?

silian 2025-10-26T22:14:24.672199Z

My business logic references an attribute (`:org`) but I modeled it poorly in my schema. I need to change :db/valueType. I want to "update" this attribute in my schema model to avoid renaming every instance in the codebase. Is this possible?

silian 2025-10-27T23:14:40.868219Z

I actually like to keep the model and its attributes as general as possible. (Solo projects.) One would think that schema definitions also have a history, such that as-of could also apply to schema definitions at a particular instant. (Or maybe that's a horrible idea?)

silian 2025-10-27T23:15:52.706469Z

If there's git history for one's codebase, maybe there should be a history for one's schema definitions. (Schemas are just data, after all, and change as the project and codebase does.)

respatialized 2025-10-28T00:43:11.440509Z

I believe that what you're describing would be extremely difficult to reconcile with the indices (and probably also the underlying storage model) used by Datalog DBs. Datahike's docs https://cljdoc.org/d/io.replikativ/datahike/0.6.1607/doc/schema-migration#how-to-migrate-a-database-schema: > When we are speaking of changes to your schema, these should always add new definitions and never change existing definitions. In case you want to change existing data to a new format you will have to create a new schema and transact your existing data transformed again.

respatialized 2025-10-26T23:26:38.612789Z

Unless you're ok with paying the cost of deleting and recreating the database entirely, the general advice here is "add a new attribute with the correct name and type." I'm not 100% sure about Datahike specifically, but the datalog DBs are not really built for editing schema - I think in large part because of the history-preserving nature of recording all the transactions in the DB. Might be worth using this as an opportunity to add a namespace to that keyword to disambiguate it from other concepts!

Clojurians Log v2

datahike