arXiv OAI-PMH
Bulk metadata harvest (preferred for an up-to-date mirror). Categories follow the public taxonomy; each paper has a primary subject class.
Primary category mix inside the AI surge Β· cs.LG, CV, CL & allies
The headline story is well known: AI-related submissions on arXiv grew fast. The more interesting question for practitioners is which lanes inside βAIβ carried the mass β machine learning (`cs.LG`), vision (`cs.CV`), language (`cs.CL`), narrow AI (`cs.AI`), neural computing (`cs.NE`), and statistics-side ML (`stat.ML`). Below: mix within a defined AI bucket (each paper counted once by primary category), plus how that bucket sits against all CS primaries. All-arXiv yearly totals use the official monthly submissions CSV from arxiv.org/stats/get_monthly_submissions, summed by calendar year.
Codes are arXiv primary subject classes. Hover or tap a color in the charts below for counts; click to open the drill-down panel with links to live listings.
Stacked bars show how primary submissions split across six AI-related categories each year. Width is proportional to papers in that category. Hover a segment for the tooltip; click to dig deeper.
Normalized to 100% within the bucket. Hover a row for the tooltip; click the label or bar to dig deeper.
Roughly what fraction of computer-science submissions (primary) fall into these six categories combined β illustrative totals aligned with the same JSON.
Bulk metadata harvest (preferred for an up-to-date mirror). Categories follow the public taxonomy; each paper has a primary subject class.
Official submission statistics by category and year β useful for cross-checking aggregates.
All-arXiv yearly totals are the sum of calendar months from arXivβs official monthly submission statistics; download the CSV from the link below.