Skip to content

Commit

Permalink
Add suggestions by Alves
Browse files Browse the repository at this point in the history
  • Loading branch information
ljgarcia committed Jun 26, 2023
1 parent 5212706 commit 058adf1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion pages/_groups/MachineLearning.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,6 @@ members:

<h2>Further Details</h2>

<p>Machine Learning (ML) are nowadays a common path in data-driven research due to the amount of available data and the resources needed to process it and make sense out of it. In addition to data, software also plays and important role in ML. Models produced by an ML training process also become a thing on their own, a thing that could be seen as similar to software (e.g., prediction model that can be executed with some input and produce a prediction as output) or to data (e.g., clusters emerged from a clustering approach). Furthermore, the training software has to be tuned and optimized while the model has to be evaluted, either intrinsic or extrinsic. Ideally, all of this information should be reported and represented as metadata of the ML process. However, this is not always the case. This group, a joint effort across <a href="https://www.rd-alliance.org/groups/fair-machine-learning-fair4ml-ig" target="_blank">Research Data Alliance FAIR4ML Interest Group</a>, <a href="https://elixir-europe.org/focus-groups/machine-learning" target="_blank">ELIXIR Machine Learning Focus Group</a> and <a href="https://www.nfdi4datascience.de/" target="_blank">NFDI4DataScience</a>, aims at providing a common ground for the metadata necessary to describe ML approaches. </p>
<p>Machine Learning (ML) is nowadays a common path in data-driven research due to the amount of available data and the resources needed to process it and make sense out of it. In addition to data, software also plays and important role in ML. Models produced by an ML training process also become a thing on their own, a thing that could be seen as similar to software (e.g., prediction model that can be executed with some input and produce a prediction as output) or to data (e.g., clusters emerged from a clustering approach). Furthermore, the training software has to be tuned and optimized while the model has to be evaluted, either intrinsic or extrinsically. Ideally, all of this information should be reported and represented as metadata of the ML process. However, this is not always the case. This group, a joint effort across <a href="https://www.rd-alliance.org/groups/fair-machine-learning-fair4ml-ig" target="_blank">Research Data Alliance FAIR4ML Interest Group</a>, <a href="https://elixir-europe.org/focus-groups/machine-learning" target="_blank">ELIXIR Machine Learning Focus Group</a> and <a href="https://www.nfdi4datascience.de/" target="_blank">NFDI4DataScience</a>, aims at providing a common ground for the metadata necessary to describe ML approaches. </p>

<p>To achieve its objectives, this group is using as a starting point <a href="https://research.google/pubs/pub48120/" target="_blank">Machine Learning Cards for models and datasets</a>. Other efforst will also be taken into account, e.g., <a href="https://www.nature.com/articles/s41592-021-01205-4" target="_blank">Data, Optimization, Model and Evaluation (DOME) recommendations</a>, <a href="https://doi.org/10.1038/s41592-021-01241-0" target="_blank">AIMe registry for artificial intelligence in biomedical research</a> and <a href="https://huggingface.co/" target="_blank">HuggingFace</a>.</p>

0 comments on commit 058adf1

Please sign in to comment.