This page is not created by, affiliated with, or supported by Slack Technologies, Inc.
2022-07-05
Channels
- # announcements (1)
- # asami (21)
- # aws (19)
- # babashka (37)
- # beginners (38)
- # clj-kondo (7)
- # clj-otel (8)
- # clojure (29)
- # clojure-europe (54)
- # clojure-nl (3)
- # clojure-spec (2)
- # clojure-uk (2)
- # clojurescript (15)
- # conjure (1)
- # data-science (1)
- # datomic (21)
- # emacs (6)
- # events (3)
- # figwheel-main (1)
- # gratitude (13)
- # holy-lambda (11)
- # joyride (6)
- # klipse (3)
- # malli (14)
- # missionary (26)
- # nbb (31)
- # omni-trace (2)
- # pathom (3)
- # reagent (1)
- # reitit (1)
- # releases (1)
- # shadow-cljs (24)
- # sql (27)
- # tools-deps (4)
- # vim (21)
Hi, I worked a bit with your Hacker News Scraper. The https://pathom3.wsscode.com/docs/tutorials/hacker-news-scraper#read-item-details function will fail to parse item/title
and item/url
. Apparently Hacker News changed class
name from "storylink" to "titlelink".
Modify the code as follows:
(defn extract-item-from-hickory [el]
{:hacker-news.item/age (class-text el "age")
:hacker-news.item/author-name (class-text el "hnuser")
:hacker-news.item/comments-count (->> (hs/select (hs/find-in-text #"comments$") el)
first
(find-text)
(select-number))
:hacker-news.item/score (select-number (class-text el "score"))
:hacker-news.item/id (->> el :content first :attrs :id)
:hacker-news.item/rank-in-page (select-number (class-text el "rank"))
:hacker-news.item/source (class-text el "sitestr")
:hacker-news.item/title (class-text el "titlelink")
:hacker-news.item/url (->> (hs/select (hs/class "titlelink") el)
first :attrs :href)
})
thanks! can you send a PR with the changes? there is an Edit this page
button at the end of every page in the docs 🙂