Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

description wording recommendations #53

Open
2 tasks done
Lukisong82 opened this issue Oct 31, 2024 · 1 comment
Open
2 tasks done

description wording recommendations #53

Lukisong82 opened this issue Oct 31, 2024 · 1 comment
Labels
documentation Improvements or additions to documentation

Comments

@Lukisong82
Copy link

Self Checks

Provide a description of requested docs changes

Any method (I recommend we use any "channel"): Different users are familiar with different forms of data access, we hope users can access data through various means, including but not limited to: SQL, DataFrames, HTTPS, gRPC, FTP, WebDAV, FUSE

Any data (I recommend we use any "format": Users should be able to access data in any format, such as: JSON, CSV, ORC, XML, XLSX, BLOB

Across any service (I recommend we delete "across") : Users’ expected data storage services also vary, we hope users can access, transfer, and control data in any service, such as: RPC, S3, IPFS, Azblob, HDFS, Google Drive, BigQuery, WebDAV, MySQL, PostgreSQL

@Lukisong82 Lukisong82 added the documentation Improvements or additions to documentation label Oct 31, 2024
@Liquidwe
Copy link
Member

Liquidwe commented Nov 1, 2024

Hi @Lukisong82

Suggestions are welcome~

“Method” encompasses various ways of data access and operations, such as SQL queries, DataFrame manipulations, HTTPS requests, and gRPC calls. These are technical means or interfaces for achieving data interaction and are concepts familiar to developers and data consumers.

“Channel” often refers to the path or medium of data transmission, such as network channels and communication protocols. It emphasizes the pathway of data transfer rather than how data access or operations are conducted, which can be misleading and suggest it only involves communication-level technologies.

“Any Data” refers not only to the format of data but also to its content and semantics. In data science and data engineering, data involves more than just format—it includes the processing and analysis of data content. This covers traditional big data storage (such as HDFS) and vector databases in AI, representing not just format differences but distinctions in data content and application methods across different scenarios.

“Any Format” might be misunderstood as merely supporting different file formats or data structure forms.

If there are more ideas, we can continue to discuss them further.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants