-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(Dataframe): pull method to fetch dataset from remote server #1446
Merged
Merged
Changes from 7 commits
Commits
Show all changes
10 commits
Select commit
Hold shift + click to select a range
e0192fa
feat(dataframe): save dataframe to path
ArslanSaleem fe28b19
feat(dataframe): save dataframe to path
ArslanSaleem 8298fa3
feat(dataframe): save path in dataframe
ArslanSaleem a71e5db
feat(push): push dataset to the remote server
ArslanSaleem 730ce1c
Merge branch 'release/v3' into dataframe/push
ArslanSaleem d84cc1b
feat(pull): pull dataset files
ArslanSaleem eebfcb5
Merge branch 'release/v3' into dataframe/pull
ArslanSaleem 5fd17ac
fix(pull): clean and reuse Session class
ArslanSaleem ab21002
fix(pull): clean error messages
ArslanSaleem 61401ba
Update pandasai/__init__.py
gventuri File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -3,9 +3,16 @@ | |
PandasAI is a wrapper around a LLM to make dataframes conversational | ||
""" | ||
|
||
from io import BytesIO | ||
import os | ||
from typing import List | ||
from zipfile import ZipFile | ||
|
||
import pandas as pd | ||
import requests | ||
|
||
from pandasai.exceptions import DatasetNotFound | ||
from pandasai.helpers.path import find_project_root | ||
from .agent import Agent | ||
from .helpers.cache import Cache | ||
from .dataframe.base import DataFrame | ||
|
@@ -74,6 +81,21 @@ def load(dataset_path: str, virtualized=False) -> DataFrame: | |
DataFrame: A new PandasAI DataFrame instance with loaded data. | ||
""" | ||
global _dataset_loader | ||
dataset_full_path = os.path.join(find_project_root(), "datasets", dataset_path) | ||
if not os.path.exists(dataset_full_path): | ||
api_key = os.environ.get("PANDAAI_API_KEY", None) | ||
api_url = os.environ.get("PANDAAI_API_URL", None) | ||
headers = {"accept": "application/json", "x-authorization": f"Bearer {api_key}"} | ||
|
||
file_data = requests.get( | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Consider using the |
||
f"{api_url}/datasets/pull", headers=headers, params={"path": dataset_path} | ||
) | ||
if file_data.status_code != 200: | ||
raise DatasetNotFound("Dataset not found!") | ||
|
||
with ZipFile(BytesIO(file_data.content)) as zip_file: | ||
zip_file.extractall(dataset_full_path) | ||
|
||
return _dataset_loader.load(dataset_path, virtualized) | ||
|
||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Check if
api_key
andapi_url
areNone
before using them, and raise aPandasAIApiKeyError
if they are not set. This prevents potentialTypeError
when constructing headers or making requests.