BibexPy — V2 Helium

BibexPy Documentation

BibexPy v2 "Helium" is a self-hosted, reproducible bibliometric data-preparation platform for Web of Science and Scopus. It runs as a local web application installed with a single pip install bibexpy — your licensed database exports never leave your machine.

New here?

Start with Installation, then walk through the 5-step workflow using the built-in sample project.

What BibexPy does

| Stage | What happens | | --- | --- | | Data & Merge | Raw Scopus .csv + WoS .txt exports are consolidated by the Smart Merge record-linkage engine | | Records & Filtering | Multi-facet inclusion/exclusion criteria refine the corpus — reproducible, preset-based | | Harmonization | ORCID-first author disambiguation, organization/country roll-up, verifiable metadata enrichment | | Export | WoS, VOSviewer, BibTeX, RIS, CSV, TSV, XLSX — labeled per downstream tool | | Report | Audit-grounded methodology narrative + operation report for your manuscript |

Design principles

  • Deterministic by default — rule-based logic for core transformations; optional LLM assistance is restricted to user-approved borderline cases.
  • Everything is logged — an append-only audit log, pre-operation snapshots and isolated analyses make every run traceable and reversible.
  • Verifiable values only — enrichment writes data from authoritative sources (CrossRef, OpenAlex, Scopus, DataCite, Unpaywall, Europe PMC, Semantic Scholar), never machine-learning inference.

Quick start

pip install bibexpy     # macOS / Linux: pip3 install bibexpy
python -m bibexpy       # macOS / Linux: python3 -m bibexpy

The browser opens automatically at http://127.0.0.1:6060. The first launch creates a ready-to-explore Simple Project so you can try the whole pipeline before uploading your own data.

Get help