update readme #790

dtsuzuku-ibm · 2024-11-11T02:58:13Z

following template #753 (comment)

What I didn't include

Header/Author, since we can see it in github
Header/Date, since we can see it in github
Changelog, since we may better consider a way to generate changelog from the commits as github action.
Code Examples and Documentation, since we can provide it in jupyter notebook

Why are these changes needed?

Related issue number (if any).

#753

Signed-off-by: Daiki Tsuzuku <[email protected]>

shahrokhDaijavad · 2024-11-11T23:05:43Z

Thanks, @dtsuzuku-ibm. I am ok with most of the decisions you have made (like not including the information we can get from github about the Author and Date. However, @agoyal26, who created the template, may insist that we still provide such information in the README file.
Since we use your README as a model for others, let me not approve it yet, until @agoyal26 has also reviewed this and we decide what is a "must" in the README file. Again, thanks for doing this before all other transform owners!

agoyal26 · 2024-11-12T16:56:27Z

I have made some suggestions to the read me. We can skip adding date of revisions but I think adding the list of contributors with their email would be good.

shahrokhDaijavad · 2024-11-12T17:03:57Z

Where are your suggestions, @agoyal26? (other than adding the list of contributors and their emails)

agoyal26 · 2024-11-12T17:06:50Z

I added them in readme.md itself as comments- are they visible? @shahrokhDaijavad

shahrokhDaijavad · 2024-11-12T17:40:53Z

They are probably on your local copy of the file, @agoyal26. I don't see them. Do you want to just copy and paste your local README file here?

agoyal26

Please see some comments on readme to improve readability

transforms/language/doc_quality/python/README.md

shahrokhDaijavad · 2024-11-12T17:46:31Z

Thanks, @agoyal26 Now, I see your comments!

Signed-off-by: Daiki Tsuzuku <[email protected]>

transforms/language/doc_quality/python/README.md

dtsuzuku-ibm · 2024-11-13T07:17:05Z

@agoyal26

Should we not call it content then instead of description in table header ?

I'm ok with that. Should I change the name of description column of the table in Output columns annotated by this transform to content, too? Both columns are for explaining what kind of data you'll see in each column listed in table.

agoyal26 · 2024-11-13T08:09:42Z

oh my apologies - I get it now. Let it be description of the column. Also please add contributors and authors name and email id to readme. Then looks like we are good to go. Thank you

Signed-off-by: Daiki Tsuzuku <[email protected]>

agoyal26

lgtm

dtsuzuku-ibm · 2024-11-13T13:58:09Z

@shahrokhDaijavad Could you assign any autorized user as reviewer?

shahrokhDaijavad

LGTM. Thank you @dtsuzuku-ibm and @agoyal26

shahrokhDaijavad · 2024-11-13T15:19:01Z

@dtsuzuku-ibm I think this is ready to merge. I will ask @touma-I to do this.

dtsuzuku-ibm · 2024-11-13T23:26:25Z

@shahrokhDaijavad I'm afraid I'm not authorized to merge this PR... Could you merge this?

touma-I · 2024-11-13T23:51:16Z

transforms/language/doc_quality/python/README.md

@shahrokhDaijavad @dtsuzuku-ibm This shows how to run the example script once we have cloned the repo. I wonder if we should also add a section to it that would explain how to use it in a notebook or a python script without cloning the repo but only with pip install . i.e.:

!pip install data-prep-toolkit
!pip install data-prep-toolkit-transform[doc_quality]

from doc_quality_transform python import ...

params = {
...
}
...

laucher.launch()

@touma-I

Please add example. with a few lines of code on how one can do pip install, setup the parameter block for input/output folder and then invoke the runtime launcher using the default parameters . Thanks

We can do all of these in jupyter/collab notebook which will be added in the future.

@dtsuzuku-ibm . that is great. Once you have the notebook, you can reference it. For now, I don't think this is complete until we have one or the other or both.

touma-I

Please add example. with a few lines of code on how one can do pip install, setup the parameter block for input/output folder and then invoke the runtime launcher using the default parameters . Thanks

shahrokhDaijavad · 2024-11-14T00:20:38Z

@dtsuzuku-ibm Since we don't have the notebook example yet, @touma-I is suggesting adding these few lines of code as an example of how we will be using all transforms in the future using pip install and not cloning the repo. Once we have the notebook example, we may remove these lines and just refer to the notebook.

agoyal26 · 2024-11-14T04:23:22Z

very good point @touma-I . I agree too- as we are promoting that DPK transforms are easy to use via pip install - we should include these steps.

dtsuzuku-ibm · 2024-11-14T04:26:41Z

@touma-I @shahrokhDaijavad @agoyal26
User can refer to src/doc_quality_local_python.py. Is adding link enough?
I want to avoid copying&pasting duplicated codes.

touma-I · 2024-11-14T11:17:56Z

@touma-I @shahrokhDaijavad @agoyal26 User can refer to src/doc_quality_local_python.py. Is adding link enough? I want to avoid copying&pasting duplicated codes.

Hi @dtsuzuku-ibm I understand the desire not to copy/paste and hopefully we will get to the point where we can automate some of this stuff maybe via CI/CD. But for now, it is not clear to me what is the minimum that the user needs to do to use this transform. I think having it explained in 5-6 lines of code will help get us to that point.

dtsuzuku-ibm requested a review from shahrokhDaijavad November 11, 2024 02:58

update readme following template #753 (comment)

1a70530

Signed-off-by: Daiki Tsuzuku <[email protected]>

dtsuzuku-ibm force-pushed the update-doc_quality-readme branch from 1747aeb to 1a70530 Compare November 11, 2024 03:14

shahrokhDaijavad mentioned this pull request Nov 11, 2024

Uniform documentation and example Notebooks for all transforms! #753

Open

2 tasks

shahrokhDaijavad requested a review from agoyal26 November 11, 2024 23:03

agoyal26 requested changes Nov 12, 2024

View reviewed changes

fix typo and update description

ecb87b0

Signed-off-by: Daiki Tsuzuku <[email protected]>

dtsuzuku-ibm force-pushed the update-doc_quality-readme branch from 9547de6 to ecb87b0 Compare November 13, 2024 01:32

agoyal26 requested changes Nov 13, 2024

View reviewed changes

transforms/language/doc_quality/python/README.md Show resolved Hide resolved

add name/email of contributor

e3fae5d

Signed-off-by: Daiki Tsuzuku <[email protected]>

dtsuzuku-ibm requested a review from agoyal26 November 13, 2024 08:52

agoyal26 approved these changes Nov 13, 2024

View reviewed changes

shahrokhDaijavad approved these changes Nov 13, 2024

View reviewed changes

touma-I reviewed Nov 13, 2024

View reviewed changes

touma-I requested changes Nov 13, 2024

View reviewed changes

dtsuzuku-ibm requested a review from touma-I November 14, 2024 04:26

dtsuzuku-ibm requested review from agoyal26 and shahrokhDaijavad November 14, 2024 04:26

shahrokhDaijavad mentioned this pull request Nov 14, 2024

Template for single transform notebook examples #754

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update readme #790

update readme #790

dtsuzuku-ibm commented Nov 11, 2024 •

edited

Loading

shahrokhDaijavad commented Nov 11, 2024

agoyal26 commented Nov 12, 2024

shahrokhDaijavad commented Nov 12, 2024

agoyal26 commented Nov 12, 2024

shahrokhDaijavad commented Nov 12, 2024

agoyal26 left a comment

shahrokhDaijavad commented Nov 12, 2024

dtsuzuku-ibm commented Nov 13, 2024 •

edited

Loading

agoyal26 commented Nov 13, 2024

agoyal26 left a comment

dtsuzuku-ibm commented Nov 13, 2024

shahrokhDaijavad left a comment

shahrokhDaijavad commented Nov 13, 2024

dtsuzuku-ibm commented Nov 13, 2024

touma-I Nov 13, 2024

dtsuzuku-ibm Nov 14, 2024 •

edited

Loading

touma-I Nov 14, 2024

touma-I left a comment

shahrokhDaijavad commented Nov 14, 2024

agoyal26 commented Nov 14, 2024

dtsuzuku-ibm commented Nov 14, 2024

touma-I commented Nov 14, 2024

update readme #790

Are you sure you want to change the base?

update readme #790

Conversation

dtsuzuku-ibm commented Nov 11, 2024 • edited Loading

Why are these changes needed?

Related issue number (if any).

shahrokhDaijavad commented Nov 11, 2024

agoyal26 commented Nov 12, 2024

shahrokhDaijavad commented Nov 12, 2024

agoyal26 commented Nov 12, 2024

shahrokhDaijavad commented Nov 12, 2024

agoyal26 left a comment

Choose a reason for hiding this comment

shahrokhDaijavad commented Nov 12, 2024

dtsuzuku-ibm commented Nov 13, 2024 • edited Loading

agoyal26 commented Nov 13, 2024

agoyal26 left a comment

Choose a reason for hiding this comment

dtsuzuku-ibm commented Nov 13, 2024

shahrokhDaijavad left a comment

Choose a reason for hiding this comment

shahrokhDaijavad commented Nov 13, 2024

dtsuzuku-ibm commented Nov 13, 2024

touma-I Nov 13, 2024

Choose a reason for hiding this comment

dtsuzuku-ibm Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

touma-I Nov 14, 2024

Choose a reason for hiding this comment

touma-I left a comment

Choose a reason for hiding this comment

shahrokhDaijavad commented Nov 14, 2024

agoyal26 commented Nov 14, 2024

dtsuzuku-ibm commented Nov 14, 2024

touma-I commented Nov 14, 2024

dtsuzuku-ibm commented Nov 11, 2024 •

edited

Loading

dtsuzuku-ibm commented Nov 13, 2024 •

edited

Loading

dtsuzuku-ibm Nov 14, 2024 •

edited

Loading