BibexPy Documentation
BibexPy v2 "Helium" is a self-hosted, reproducible bibliometric data-preparation platform
for Web of Science and Scopus. It runs as a local web application installed with a
single pip install bibexpy — your licensed database exports never leave your machine.
New here?
Start with Installation, then walk through the 5-step workflow using the built-in sample project.
What BibexPy does
| Stage | What happens |
| --- | --- |
| Data & Merge | Raw Scopus .csv + WoS .txt exports are consolidated by the Smart Merge record-linkage engine |
| Records & Filtering | Multi-facet inclusion/exclusion criteria refine the corpus — reproducible, preset-based |
| Harmonization | ORCID-first author disambiguation, organization/country roll-up, verifiable metadata enrichment |
| Export | WoS, VOSviewer, BibTeX, RIS, CSV, TSV, XLSX — labeled per downstream tool |
| Report | Audit-grounded methodology narrative + operation report for your manuscript |
Design principles
- Deterministic by default — rule-based logic for core transformations; optional LLM assistance is restricted to user-approved borderline cases.
- Everything is logged — an append-only audit log, pre-operation snapshots and isolated analyses make every run traceable and reversible.
- Verifiable values only — enrichment writes data from authoritative sources (CrossRef, OpenAlex, Scopus, DataCite, Unpaywall, Europe PMC, Semantic Scholar), never machine-learning inference.
Quick start
pip install bibexpy # macOS / Linux: pip3 install bibexpy
python -m bibexpy # macOS / Linux: python3 -m bibexpy
The browser opens automatically at http://127.0.0.1:6060. The first launch creates a
ready-to-explore Simple Project so you can try the whole pipeline before uploading
your own data.
Get help
- Troubleshooting — PATH issues, missing pip, port conflicts
- FAQ — common questions
- GitHub Issues — bug reports
- Email — info@bibexpy.com
