Commit graph

99 commits

Author SHA1 Message Date
5b8feb6368 Add general stats command 2024-12-31 17:40:20 +01:00
76efd6d728 Add redirects command 2024-12-31 17:14:21 +01:00
e90864a097 Add -I and -c cli options 2024-12-31 16:41:19 +01:00
04482f9f2f Detect redirect cycles 2024-12-31 16:40:55 +01:00
1f20e0519a Fix off-by-one in consistency check 2024-12-31 16:40:42 +01:00
6611dd3160 Add --bidi flag to path command 2024-12-31 15:51:58 +01:00
535d7ff236 Add export command 2024-12-31 15:38:30 +01:00
c573f1b0b0 Allow transforming graph before commands 2024-12-31 15:38:30 +01:00
ab7b7295ca Remove unused code 2024-12-31 15:38:30 +01:00
693ae9eb81 Show more info and optionally links 2024-12-31 15:38:30 +01:00
ceb987bbbc Add show command 2024-12-31 15:16:35 +01:00
aa4187fcd8 Group pages, links, and graph in Data struct 2024-12-31 13:07:26 +01:00
0168373509 Move dijkstra to new file 2024-12-31 13:07:26 +01:00
6ca20c9740 Remove some old code 2024-12-31 13:06:46 +01:00
4e41084f2a Port path command 2024-12-31 13:06:46 +01:00
abd6b3519c Get rid of rustc_hash 2024-12-31 13:06:46 +01:00
e04215802e Speed up ingest using rustc_hash
An enwiki ingest went from ca. 6:50 minutes down to ca. 7:00 minutes. Oh
wait...

This was not a rigorous test, but rustc_hash doesn't seem to have a
significant positive impact. Maybe I'm just holding it wrong, but right
now I'd rather remove it again and have simpler code/deps.
2024-12-31 13:06:46 +01:00
eb631250d7 Fix ingest logic yet again 2024-12-31 13:06:46 +01:00
a3d0136ad2 Fix ingest logic and panics 2024-12-31 13:06:46 +01:00
3aa8222b6b Rewrite ingest command 2024-12-31 13:06:46 +01:00
f819f5bf69 Remove adjlist-based path implementation 2024-12-31 13:06:46 +01:00
18e54c4ce1 Implement new graph data structure and dijkstra 2024-12-31 00:47:14 +01:00
778cb6748d Remove petgraph 2024-12-30 16:00:09 +01:00
34df6c9f14 Try out petgraph 2024-12-30 13:12:49 +01:00
e3e191b748 Improve philosophy game trace output 2024-12-30 13:12:25 +01:00
49665f74ce List links to and from an article 2024-12-30 13:12:14 +01:00
f5f4f99a2f Update dependencies 2024-12-29 23:28:55 +01:00
c2c1b1234c Change link data with page info 2024-12-29 23:22:02 +01:00
7af2a4e06f Print nicer sift stats 2024-12-29 20:48:52 +01:00
27416cf782 Add philosophy-game trace command 2024-08-26 19:42:47 +02:00
a12f79d864 Extract cluster ends in more easy-to-use way 2024-08-26 11:50:06 +02:00
9c73e691a8 Add more philosophy game subcommands 2024-08-26 11:32:34 +02:00
ed611d0841 Add philosophy-game command 2024-08-26 02:30:24 +02:00
8642778402 Fix in_structure some more 2024-08-26 01:29:28 +02:00
e85e17634f Speed up sift using multiprocessing 2024-08-26 01:29:28 +02:00
72a4c11b85 Fix in_structure not being computed correctly 2024-08-26 01:29:28 +02:00
159f155e4a Add option to filter links on reexport 2024-08-26 01:29:28 +02:00
76abf5ea6e Remove PageIdx and LinkIdx again
I don't think the type safety is worth the effort right now.
2024-08-26 01:29:28 +02:00
17b118693f Add PageIdx::MAX 2024-08-25 22:04:55 +02:00
7a2372fedd Refactor data representation and storage
Mostly moving around code
2024-08-25 21:40:11 +02:00
0eb745e928 Remove leon wiki graph command 2024-08-25 20:32:06 +02:00
c612bd35ad Satisfy clippy 2024-08-25 20:30:48 +02:00
4bdfc34706 Support new info in brood 2024-08-25 20:27:49 +02:00
c075e43e2d Extract additional link info 2024-08-25 19:04:13 +02:00
32ce86682c Switch from simd-json back to serde_json
If I remember correctly, there wasn't any significant performance
improvement, but I might be misremembering.
2024-06-27 21:51:29 +02:00
7a52c818fb Remove unused ciborium dependency 2024-06-27 21:49:56 +02:00
ae83abe67e Update dependencies 2024-06-27 21:38:41 +02:00
8605f8d43f Implement leon-wiki-graph command 2022-11-01 01:22:15 +01:00
60ba7721db Add longest-shortest-path command 2022-10-23 01:16:08 +02:00
1265dd4a41 Move some functions to util 2022-10-23 01:14:31 +02:00