BibexPy
Harmonizing the Bibliometric Symphony of Scopus and Web of Science
Stable Features
- Enhanced Data Processing
- Intelligent Merging System
- Automated Directory Management
- Advanced Error Handling
- Improved Data Validation
- WoS & Scopus Support
Quick Start
git clone https://github.com/bcankara/BibexPy.git
View Stable Version
Latest Features
- Enhanced Data Processing
- Intelligent Merging System
- Automated Directory Management
- Advanced Error Handling
- Improved Data Validation
- Multi-Database Support (7+ Sources)
- PubMed, Dimensions, Cochrane
- Lens, OpenAlex + WoS & Scopus
Quick Start
git clone -b Dev https://github.com/bcankara/BibexPy.git
View Developer Version
Data Collection
WoS & ScopusMerge & Clean
Intelligent ProcessingAPI Enrich
Metadata EnhancementAnalyze
Other Bibliometrics SoftwareFeatures
DOI-Based Deduplication
Intelligent Scientific Database Merge Component that combines and deduplicates data from Web of Science and Scopus using advanced bibliometric intelligence.
API-Driven Metadata
Automatically searches and enriches empty fields in merged data through 5+ API services, enhancing bibliographic databases with comprehensive metadata.
Format Conversion
Processes raw data files (WoS .txt and Scopus .csv) into standardized formats, creating enriched bibliographic outputs ready for analysis.
Prerequisites
Required Python Version
- Python ≥ 3.9.0
Libraries
python-dotenv==1.0.0
pandas>=2.0.0
openpyxl>=3.1.2
numpy>=1.24.0
requests>=2.31.0
scikit-learn>=1.3.0
scipy>=1.11.0
tqdm>=4.65.0
xlrd>=2.0.1
xlsxwriter>=3.1.0
colorama>=0.4.6
typing-extensions>=4.7.0
Installation
-
Clone the Repository
git clone https://github.com/bcankara/BibexPy.git
-
Navigate to the Directory
cd BibexPy
-
Install Dependencies
pip install -r requirements.txt
-
(Optional) Virtual Environment Setup
python -m venv venv # Mac/Linux source venv/bin/activate # Windows venv\Scripts\activate
Usage
Project Structure
BibexPy/
├── .env # Environment variables for API keys
├── requirements.txt # Python package dependencies
├── DataProcessor.py # Main application file
├── Main/
│ ├── modules/
│ │ ├── __init__.py
│ │ ├── api_utils.py # API integration utilities
│ │ ├── file_utils.py # File handling utilities
│ │ ├── merge_utils.py # Data merging operations
│ │ ├── post_process.py # Post-processing utilities
│ │ └── stats_utils.py # Statistical analysis tools
│ ├── MergeDB.py # Database merging operations
│ ├── scp2xlsx.py # Scopus to Excel converter
│ ├── wos2xlsx.py # Web of Science to Excel converter
│ └── xlsx2vos.py # Excel to VosViewer converter
└── WorkSpace/
└── Project Folder/
└── Data/ # Place your input files here
├── Wos txt data files
└── Scopus csv data files
Key Files Description
Configuration Files
-
.env
Store your API keys for Scopus and other services
-
requirements.txt
List of required Python packages
Main Application
-
DataProcessor.py
Entry point of the application
Conversion Tools
-
scp2xlsx.py
Converts Scopus CSV files to Excel format
-
wos2xlsx.py
Converts Web of Science TXT files to Excel format
-
xlsx2vos.py
Converts Excel files to VosViewer format
Utility Modules
-
api_utils.py
API integration and data enrichment
-
merge_utils.py
Data merging and deduplication
-
stats_utils.py
Statistical analysis and reporting
Follow the Workflow
-
1
Create Project Folder
Create a new folder in the
Workspace
directory with your project name.Workspace/Your_Project_Name/
-
2
Prepare Data Folder
Create a
Data
folder inside your project folder and place your files there:Workspace/Your_Project_Name/Data/
- Scopus export files (.csv)
- Web of Science export files (.txt)
-
3
Run the Application
Execute the program from the terminal:
python DataProcessor.py
-
4
Select and Process
- Choose your project from the list
- Program will automatically detect and process your files
- Choose to enrich metadata (optional)
- Review and confirm the processed results
-
5
Results
Find your processed files in the Analysis folder:
Workspace/Your_Project_Name/Analysis_YYYYMMDD_HHMMSS/
Outputs
BibexPy generates the following outputs:
Unified Datasets
Combined and deduplicated data in Prefix_Bib.xlsx
format.
VosViewer Files
Network visualization ready files in Prefix_Vos.txt
format.
Statistics
Detailed summaries for dataset quality and completeness.
Documentation
For detailed documentation and examples, visit our comprehensive guide:
View DocumentationSupport
License
BibexPy is licensed under the GNU General Public License (GPL). See the LICENSE file for details.