lechmazur / generalization

Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of examples and anti-examples, then detect which item truly fits that theme among a collection of misleading candidates.

Date Created 2025-01-14 (2 months ago)
Commits 29 (last one 3 days ago)
Stargazers 41 (0 this week)
Watchers 3 (0 this week)
Forks 1
License unknown
Ranking

RepositoryStats indexes 630,459 repositories, of these lechmazur/generalization is ranked #538,771 (15th percentile) for total stargazers, and #416,812 for total watchers.

lechmazur/generalization is also tagged with popular topics, for these it's ranked: llm (#2,830/3536),  benchmark (#723/820),  llms (#534/678)

Star History

Github stargazers over time

45454040353530302525202015151010550020 Jan20 JanFeb '25Feb '2510 Feb10 Feb20 Feb20 FebMar '25Mar '2510 Mar10 Mar20 Mar20 Mar

Watcher History

Github watchers over time, collection started in '23

3333332.52.522222208 Feb08 Feb16 Feb16 Feb24 Feb24 FebMar '25Mar '2508 Mar08 Mar16 Mar16 Mar

Recent Commit History

29 commits on the default branch (main) since jan '22

30302525202015151010550020 Jan20 JanFeb '25Feb '2510 Feb10 Feb20 Feb20 FebMar '25Mar '2510 Mar10 Mar20 Mar20 Mar

Yearly Commits

Commits to the default branch (main) per year

2222111111000020242024

Issue History

No issues have been posted

Languages

We don't have any language data for this repository

It's a mystery

updated: 2025-03-21 @ 03:53am, id: 916571124 / R_kgDONqHD9A