This page is not created by, affiliated with, or supported by Slack Technologies, Inc.
2023-03-21
Channels
- # announcements (26)
- # babashka (115)
- # babashka-sci-dev (5)
- # beginners (48)
- # calva (69)
- # cider (4)
- # clj-commons (11)
- # clj-kondo (1)
- # cljfx (29)
- # clojure (109)
- # clojure-art (1)
- # clojure-czech (1)
- # clojure-europe (33)
- # clojure-nl (1)
- # clojure-nlp (3)
- # clojure-norway (7)
- # clojure-uk (1)
- # clojurescript (63)
- # clr (1)
- # data-science (41)
- # datalevin (1)
- # datomic (11)
- # emacs (58)
- # etaoin (11)
- # figwheel-main (1)
- # fulcro (5)
- # google-cloud (12)
- # helix (2)
- # honeysql (21)
- # hyperfiddle (22)
- # joyride (53)
- # malli (52)
- # off-topic (27)
- # portal (4)
- # re-frame (19)
- # releases (3)
- # ring-swagger (5)
- # xtdb (30)
morning
morning!
today’s task: web scraping a long list of URLs from an Excel document using Etaoin and docjure
I wonder if the document part of the HTML file can/should be converted into markdown…? the point is to create a dataset for NLP
cc @UE21H2HHD Look, an etaoin user :)
seems to work OK with a try-catch in the mapped function… aside from the throttling
Glad you are making use of Etaoin @U4P4NREBY! If you run into problems, or make delightful discoveries, please do drop by #C7KDM0EKW and share!
Good morning
maaning
Two things, one easy, other hard The glob pattern needs to find a .so file LD path needs to add Emacs's libs (harder)
I guess this is also praise 😄 (aimed at @borkdude) https://clojurians.slack.com/archives/C03S1L9DN/p1679132244057929?thread_ts=1679131290.485249&cid=C03S1L9DN