docs: token classification tutorial #5183

sdiazlor · 2024-07-09T09:15:07Z

Pull Request Template

Closes #5173

Type of change

Documentation update

How Has This Been Tested

Checklist

I added relevant documentation

review-notebook-app · 2024-07-09T09:15:12Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

sdiazlor · 2024-07-09T09:18:26Z

@davidberenstein1957 After querying a dataset, the best way to iterate over the retrieved records is using .to_list. I think it's useful to change it in the how-to guides instead of the current list(). WDYT?

github-actions · 2024-07-09T09:23:59Z

Docs for this PR have been deployed hidden from versioning: https://argilla-io.github.io/argilla/docs_update-admonitions-token-tutorial

davidberenstein1957 · 2024-07-09T09:28:17Z

@sdiazlor I see there is something wrong with the doc workflow. I will check what is going wrong here. I think it might be the GH actions extension we are using because the message seems to have been printed correctly within the actions workflow.

davidberenstein1957 · 2024-07-09T09:38:35Z

@sdiazlor, thanks for handling this. I think it will offer a good basis for people to get started.

Some high level comments, will potentially leave some more in the notebook :)

not 100% sure what you mean by list vs to_list. Ideally, we should unify communication on how we intend the API to be used. In that case the Records.to_list would be the preferred way and we should try to use this in the rest of the guides too.
I think we should try to format cells in the notebook to adhere to line width etc, this can be done by right-clicking on a notebook and then selecting "format". Perhaps you can add ipynb files to the ruff pre.commit formatter?
I think we can remove "Task" from the naming of both the tutorials.

argilla/docs/tutorials/token_classification.ipynb

sdiazlor · 2024-07-09T09:49:32Z

@davidberenstein1957 Thanks for your comments!

I meant that when doing this, it is difficult to iterate over the records and retrieve the needed data(fields, suggestions, responses, etc.).
filtered_records = list(dataset.records(status_filter))

Instead, as in the tutorials, it's more helpful to use.
filtered_records = dataset.records(status_filter).to_list(flatten=True)

But your answer solved the doubt, so I'll update it.

Thank you for the formatting reminder, ipynb files were not included.

sdiazlor · 2024-07-11T10:03:39Z

@davidberenstein1957 I applied the changes. However, it doesn't let me update the length too much, especially for the scripts due to pre-commit. We should update the length in pyproject.toml from 120 to the default 88.

davidberenstein1957 · 2024-07-11T11:45:42Z

@sdiazlor, feel free to change it to line-width=88 in another PR :)

#5211

argilla/mkdocs.yml

argilla/docs/tutorials/index.md

sdiazlor added 3 commits July 5, 2024 17:27

docs: update with latest changes

ec623cd

docs: mkdocs

48fcedb

docs: add tutorial

2300420

sdiazlor requested a review from davidberenstein1957 July 9, 2024 09:15

davidberenstein1957 reviewed Jul 9, 2024

View reviewed changes

argilla/docs/tutorials/token_classification.ipynb Show resolved Hide resolved

argilla/docs/tutorials/token_classification.ipynb Show resolved Hide resolved

argilla/docs/tutorials/token_classification.ipynb Show resolved Hide resolved

davidberenstein1957 linked an issue Jul 9, 2024 that may be closed by this pull request

[DOCS] Token classification tutorial #5173

Closed

sdiazlor added 2 commits July 11, 2024 11:46

docs: update to_list

164838d

docs: feedback tutorials

5681e40

davidberenstein1957 reviewed Jul 11, 2024

View reviewed changes

argilla/mkdocs.yml Outdated Show resolved Hide resolved

argilla/docs/tutorials/index.md Outdated Show resolved Hide resolved

davidberenstein1957 added 2 commits July 11, 2024 13:49

Apply suggestions from code review

2ed367d

Update index.md

97e9a6c

davidberenstein1957 approved these changes Jul 11, 2024

View reviewed changes

davidberenstein1957 merged commit dd54b3e into main Jul 11, 2024
7 checks passed

davidberenstein1957 deleted the docs/5173-docs-token-classification-tutorial branch July 11, 2024 11:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: token classification tutorial #5183

docs: token classification tutorial #5183

sdiazlor commented Jul 9, 2024

review-notebook-app bot commented Jul 9, 2024

sdiazlor commented Jul 9, 2024

github-actions bot commented Jul 9, 2024 •

edited

Loading

davidberenstein1957 commented Jul 9, 2024 •

edited

Loading

davidberenstein1957 commented Jul 9, 2024

sdiazlor commented Jul 9, 2024

sdiazlor commented Jul 11, 2024

davidberenstein1957 commented Jul 11, 2024 •

edited

Loading

docs: token classification tutorial #5183

docs: token classification tutorial #5183

Conversation

sdiazlor commented Jul 9, 2024

Pull Request Template

review-notebook-app bot commented Jul 9, 2024

sdiazlor commented Jul 9, 2024

github-actions bot commented Jul 9, 2024 • edited Loading

davidberenstein1957 commented Jul 9, 2024 • edited Loading

davidberenstein1957 commented Jul 9, 2024

sdiazlor commented Jul 9, 2024

sdiazlor commented Jul 11, 2024

davidberenstein1957 commented Jul 11, 2024 • edited Loading

github-actions bot commented Jul 9, 2024 •

edited

Loading

davidberenstein1957 commented Jul 9, 2024 •

edited

Loading

davidberenstein1957 commented Jul 11, 2024 •

edited

Loading