SpOntecular

Note: SpOntecular is currently in development.

SpOntecular is a proof of concept for (semi-) automating the extraction of an ontology from technical specifications using the large language model GPT-4x and the semantic framework Apache Jena. The aim is to reduce the manual effort required to identify the individual ontology features.

Development Status

SpOntecular is actively being developed. Below is an outline of its current and planned features.

Supported features

Implemented:

Automated extraction of classes and based on that, deriving of taxonomic (hierarchical) and non-taxonomic relationships as well as cardinality constraints.

In progress:

Possibility to
- provide custom definitions of the ontology features
- add specific examples to provide more context for few-shot prompting
- blacklist falsely identified features to exclude them from subsequent extraction cycles

Planned:

Functionality to import existing ontologies
Functionality to download the resulting ontology

Implementation schema

The extraction process was implemented as a seven-step workflow. The first four stages are used for the actual extraction of the individual ontology components using GPT-4. To do this, the text corpus from which the ontology is to be generated is first passed to GPT-4 via an API call, together with the appropriate prompt. JSON has been defined as the output format.

In step 1, the concepts are first identified and returned as a JSON list. The results are then passed together with the text corpus to step 2 to build the concept hierarchy and to step 3 to identify the non-taxonomic relations. The identified non-taxonomic relations are then passed to stage 4 to derive the corresponding cardinalities. In addition to passing the intermediate results to each subsequent stage, they are also written to a cache. The cache is initially used to store the individual components of the ontology in order to merge them later.

Functions / Technologies

Function	Technology
Front-End - Template Engine - Interactivity	Thymeleaf Alpine.js, htmx
Backend	Spring Boot
Document processing	Apache POI, Apache PDFBox
Ontology processing	Apache Jena
Containerization	Docker

Prerequisites

Provide your OpenAI API key as environment variable OPENAI_API_KEY.

Access

Enter http://localhost:8090 after startup
or visit live demo at https://spontencular.konstantinwolters.com

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
.mvn/wrapper		.mvn/wrapper
documentation/images		documentation/images
node_modules		node_modules
src		src
.gitignore		.gitignore
.prettierrc		.prettierrc
Dockerfile		Dockerfile
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
package-lock.json		package-lock.json
package.json		package.json
pom.xml		pom.xml
tailwind.config.js		tailwind.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpOntecular

Development Status

Supported features

Implemented:

In progress:

Planned:

Implementation schema

Functions / Technologies

Prerequisites

Access

About

Releases

Packages

Languages

konwolters/spontecular

Folders and files

Latest commit

History

Repository files navigation

SpOntecular

Development Status

Supported features

Implemented:

In progress:

Planned:

Implementation schema

Functions / Technologies

Prerequisites

Access

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages