BibexPy — V2 Helium
BibexPy — V2.0.0 Helium — Bibliometrics Experience with Python

Harmonizing the bibliometric symphony of Scopus and Web of Science

A self-hosted, reproducible data-preparation platform. Merge, filter, harmonize, enrich and export your records — licensed exports never leave your machine.

$ pip install bibexpy

Python 3.10+ · Windows / macOS / Linux · No Node.js required · GPL-3.0

Get release announcements

New versions, features and docs — straight to your inbox. No spam, unsubscribe anytime.

Why BibexPy

From raw exports to analysis-ready data

Every step is deterministic by default, fully logged and reversible — built for reproducible bibliometric research.

One-Click Smart Merge

Multi-stage probabilistic record linkage: exact DOI matching + Jaro–Winkler title similarity, confidence scores and a borderline-review queue.

ORCID-First Disambiguation

Author identities resolved with ORCID as deterministic evidence; constrained field-similarity fallback only when coverage is incomplete.

Address Harmonization

Organization roll-up to a canonical parent institution plus country standardization — cleaner collaboration networks.

Multi-Source Enrichment

Fetch-once-fill-all across CrossRef, OpenAlex, Scopus, DataCite, Unpaywall, Europe PMC and Semantic Scholar. Verifiable values only.

Reproducible Filtering

Multi-facet inclusion/exclusion criteria with a live match count, saved as reusable presets — your corpus is reconstructable.

Weighted Quality Score

A bibliometrically weighted health score that emphasizes the fields your downstream analysis actually needs.

Full Provenance

Append-only audit log, pre-operation snapshots, isolated analyses and an auto-generated methodology narrative.

Structured Export

WoS plain text, VOSviewer TSV, BibTeX, RIS, CSV, TSV, XLSX — interoperable with VOSviewer, Biblioshiny, CiteSpace and more.

Guided Workflow

Five steps, one pipeline

Data & Merge → Records & Filtering → Harmonization → Export → Report. Each step is gated until the previous one is complete.

localhost:6060 — Data & Merge
BibexPy — Data & Merge screen

Upload raw Scopus (.csv) and Web of Science (.txt) exports — one click runs Smart Merge.

Install in 60 seconds

Two commands. No Node.js, no compiler.

BibexPy ships as a single pure-Python wheel with the web interface precompiled inside. Python 3.10+ is the only requirement — it runs the same on Windows, macOS and Linux.

  • Local web app opens in your browser automatically
  • Your data stays in ~/.bibexpy/storage — fully offline-capable
  • First launch creates a ready-to-explore sample project
Full installation guide
Windows · Terminal
pip install bibexpy
python -m bibexpy

If bibexpy isn't recognized, that's a PATH issue — on first launch the app offers to add itself to PATH (answer Y), or see Troubleshooting.

Try it before uploading your own data

The first launch creates a ready-to-explore Simple Project with real Web of Science + Scopus sample exports — run the whole pipeline end-to-end.

Learn more
Authors

Cite BibexPy

Kara, B. C., Şahin, A., & Dirsehan, T. (2025). BibexPy: Harmonizing the bibliometric symphony of Scopus and Web of Science. SoftwareX, 30, 102098. https://doi.org/10.1016/j.softx.2025.102098
Open paper

Published in SoftwareX 30 (2025), 102098 — the v2 “Helium” platform builds on the peer-reviewed v1 methodology.