🦏 RhinoScraper v 0.2

RhinoScraper is an advanced OSINT (Open Source Intelligence) tool designed to analyze websites and extract various types of information, including security data, contact details, technologies used.

▗▄▄▖ ▐▌   ▄ ▄▄▄▄   ▄▄▄   ▗▄▄▖▗▞▀▘ ▄▄▄ ▗▞▀▜▌▄▄▄▄  ▗▞▀▚▖ ▄▄▄ 
▐▌ ▐▌▐▌   ▄ █   █ █   █ ▐▌   ▝▚▄▖█    ▝▚▄▟▌█   █ ▐▛▀▀▘█    
▐▛▀▚▖▐▛▀▚▖█ █   █ ▀▄▄▄▀  ▝▀▚▖    █         █▄▄▄▀ ▝▚▄▄▖█    
▐▌ ▐▌▐▌ ▐▌█             ▗▄▄▞▘              █               
                                           ▀

DISCLAIMER

This script is currently beta. Use at your own risks

Features

RhinoScraper can extract and analyze:

Security Information
- SSL certificate details
- Security headers
- Exposed sensitive files
- robots.txt content
Technology Detection
- CMS identification
- Web frameworks
- Server technology
- Security implementations
Contact Information
- Email addresses (with validation)
- Phone numbers (international format)
- Social media links
Technical Data
- HTML comments
- Meta tags
- Google Analytics codes
- Domain information (WHOIS)

Installation

Clone the repository:

git clone https://github.com/degun-osint/rhinoscraper.git
cd rhinoscraper

Install required dependencies:

pip install -r requirements.txt

Dependencies

beautifulsoup4
requests
python-whois
colorama
phonenumbers
email-validator
diskcache
validators

Usage

Run the script:

python main.py

The tool will prompt you for:

The URL to analyze
The maximum depth for crawling (1-3)

Output

RhinoScraper generates an HTML report containing:

Comprehensive analysis results
Color-coded risk assessments
Interactive elements
Clean, modern design
Mobile-friendly layout

Reports are saved as HTML files with the following naming convention:

rhinoscraper_report_[domain]_[timestamp].html

Caching

The tool implements a caching system to:

Avoid redundant scraping
Improve performance
Reduce server load
Store results for 7 days (configurable)

Features in Detail

Sensitive File Detection

Checks for commonly exposed sensitive files and directories:

.git
.env
wp-config.php
and more...

Email Validation

Extracts potential email addresses
Validates format and structure
Removes duplicates
Identifies domains

Social Media Detection

Identifies profiles on:

Facebook
Twitter
LinkedIn
Instagram
YouTube

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Security

RhinoScraper is designed for legal and ethical use only
Always obtain permission before scanning non-public websites
Be mindful of rate limiting and server load
Follow responsible disclosure practices for any security findings

License

This project is licensed under the MIT License - see the LICENSE file for details.

Disclaimer

This tool is for educational purposes only. Users are responsible for complying with applicable laws and regulations. The authors are not responsible for any misuse or damage caused by this program.

Author

Degun

Acknowledgments

Beautiful Soup documentation
Python Requests library
OSINT community

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
config		config
core		core
extractors		extractors
utils		utils
.gitignore		.gitignore
main.py		main.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🦏 RhinoScraper v 0.2

DISCLAIMER

Features

Installation

Dependencies

Usage

Output

Caching

Features in Detail

Sensitive File Detection

Email Validation

Social Media Detection

Contributing

Security

License

Disclaimer

Author

Acknowledgments

About

Releases

Packages

Languages

degun-osint/RhinoScraper

Folders and files

Latest commit

History

Repository files navigation

🦏 RhinoScraper v 0.2

DISCLAIMER

Features

Installation

Dependencies

Usage

Output

Caching

Features in Detail

Sensitive File Detection

Email Validation

Social Media Detection

Contributing

Security

License

Disclaimer

Author

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages