This Python script provides a web-based interface for analyzing bibliometric data from Excel files. It uses Gradio to create a simple UI for file upload and download. The code is adapted from a Jupyter notebook.
The Bibliometric Data Analyzer processes Excel files containing academic publication data. It performs the following operations:
- Reads an Excel file with bibliometric data.
- Processes and cleans the data, including:
- Handling different column structures based on the presence of "A-Journal Papers".
- Renaming and reorganizing H-index columns.
- Cleaning up the PromoYear column.
- Splitting Scopus IDs if they contain multiple values.
- Filters out rows with NaN values in the Name column.
- Saves the processed data to a new Excel file.
analyze_bibliometric(file)
: The main function that processes the uploaded Excel file.- Gradio interface: Provides a web UI for file upload and download.
- Python 3.x
- Required libraries:
- gradio
- numpy
- pandas
- openpyxl (for Excel file handling)
- Clone this repository or download the script.
- Install the required libraries:
pip install gradio numpy pandas openpyxl
- Open a terminal or command prompt.
- Navigate to the directory containing the script.
- Run the script:
python app.py
- Open a web browser and go to the URL provided in the terminal (usually
http://127.0.0.1:7860
). - Use the web interface to upload your bibliometric Excel file.
- The processed file will be available for download once the analysis is complete.
- The input Excel file should have the bibliometric data in the second sheet (index 1).
- The script expects specific column names and structures. Ensure your input file matches the expected format.
- The output will be saved as "filtered_bibliometric_analysis.xlsx" in the same directory as the script.