e04215802e
Speed up ingest using rustc_hash
...
An enwiki ingest went from ca. 6:50 minutes down to ca. 7:00 minutes. Oh
wait...
This was not a rigorous test, but rustc_hash doesn't seem to have a
significant positive impact. Maybe I'm just holding it wrong, but right
now I'd rather remove it again and have simpler code/deps.
2024-12-31 13:06:46 +01:00
eb631250d7
Fix ingest logic yet again
2024-12-31 13:06:46 +01:00
a3d0136ad2
Fix ingest logic and panics
2024-12-31 13:06:46 +01:00
3aa8222b6b
Rewrite ingest command
2024-12-31 13:06:46 +01:00
f819f5bf69
Remove adjlist-based path implementation
2024-12-31 13:06:46 +01:00
18e54c4ce1
Implement new graph data structure and dijkstra
2024-12-31 00:47:14 +01:00
778cb6748d
Remove petgraph
2024-12-30 16:00:09 +01:00
34df6c9f14
Try out petgraph
2024-12-30 13:12:49 +01:00
e3e191b748
Improve philosophy game trace output
2024-12-30 13:12:25 +01:00
49665f74ce
List links to and from an article
2024-12-30 13:12:14 +01:00
f5f4f99a2f
Update dependencies
2024-12-29 23:28:55 +01:00
c2c1b1234c
Change link data with page info
2024-12-29 23:22:02 +01:00
27416cf782
Add philosophy-game trace command
2024-08-26 19:42:47 +02:00
a12f79d864
Extract cluster ends in more easy-to-use way
2024-08-26 11:50:06 +02:00
9c73e691a8
Add more philosophy game subcommands
2024-08-26 11:32:34 +02:00
ed611d0841
Add philosophy-game command
2024-08-26 02:30:24 +02:00
159f155e4a
Add option to filter links on reexport
2024-08-26 01:29:28 +02:00
76abf5ea6e
Remove PageIdx and LinkIdx again
...
I don't think the type safety is worth the effort right now.
2024-08-26 01:29:28 +02:00
17b118693f
Add PageIdx::MAX
2024-08-25 22:04:55 +02:00
7a2372fedd
Refactor data representation and storage
...
Mostly moving around code
2024-08-25 21:40:11 +02:00
0eb745e928
Remove leon wiki graph command
2024-08-25 20:32:06 +02:00
c612bd35ad
Satisfy clippy
2024-08-25 20:30:48 +02:00
4bdfc34706
Support new info in brood
2024-08-25 20:27:49 +02:00
32ce86682c
Switch from simd-json back to serde_json
...
If I remember correctly, there wasn't any significant performance
improvement, but I might be misremembering.
2024-06-27 21:51:29 +02:00
7a52c818fb
Remove unused ciborium dependency
2024-06-27 21:49:56 +02:00
ae83abe67e
Update dependencies
2024-06-27 21:38:41 +02:00
8605f8d43f
Implement leon-wiki-graph command
2022-11-01 01:22:15 +01:00
60ba7721db
Add longest-shortest-path command
2022-10-23 01:16:08 +02:00
1265dd4a41
Move some functions to util
2022-10-23 01:14:31 +02:00
a7b9849183
Add --flip flag to path command
2022-10-22 20:46:26 +02:00
d5f55d2855
Print more stuff
2022-10-22 19:58:19 +02:00
8bb94b1847
Allow redirects to have 0 links
2022-10-22 19:54:08 +02:00
2e6539cbc5
Fix redirect importing
2022-10-22 19:38:35 +02:00
e91a2db1b1
Allow specifying redirects as path start and end
2022-10-22 19:38:27 +02:00
0e3d61d632
Add list-pages command
2022-10-22 19:25:01 +02:00
c40153be9f
Try out different costs
2022-10-22 19:14:16 +02:00
d99b3d49e0
Refactor changing data
2022-10-22 19:14:03 +02:00
32b72c10c8
Assign redirects a cost of 0
2022-10-22 18:48:42 +02:00
d1a80a6ae6
Print redirects differently
2022-10-22 18:41:47 +02:00
0d4087fdde
Detect if no path exists
2022-10-22 18:41:40 +02:00
179e6b74a5
Implement simple dijkstra
2022-10-22 17:14:47 +02:00
8b62ff78bd
Prepare dijkstra in path command
2022-10-22 16:23:35 +02:00
67f405a21e
Make data representation more flexible
2022-10-22 15:52:07 +02:00
49b27715f0
Print duplicate page map entries
2022-10-22 15:40:45 +02:00
853e09517f
Add unfinished path command
2022-10-22 01:21:59 +02:00
345462915b
Change AdjacencyMap associated data
2022-10-22 01:21:59 +02:00
5656f65b6c
Refactor ingestion
2022-10-22 01:21:59 +02:00
3296f6d15a
Fix page link_idx computation
2022-10-22 00:05:15 +02:00
a9435e4f64
Lowercase only first char when normalizing
2022-10-22 00:01:04 +02:00
3a75089e5a
Make adjacency list extensible
2022-10-21 20:39:53 +02:00