Skip to content

Commit

Permalink
Cleaned up lab template
Browse files Browse the repository at this point in the history
  • Loading branch information
Holly-Transport committed Sep 26, 2024
1 parent b29bce7 commit 7a93bcd
Show file tree
Hide file tree
Showing 5 changed files with 59 additions and 42 deletions.
46 changes: 46 additions & 0 deletions docs/1-intro-to-data-lab.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
# Introduction to the Data Lab

The Data Lab supports World Bank operations -- lending, technical assistance, and economic reporting -- by coordinating ad-hoc teams of data analysts and specialists from across our organization. Through the Lab, teams solve global challenges using best practices in coding, code documentation, and data visualization.

Unlike a traditional data analysis, which results in a single-use report or visualization, Data Lab products are designed to be customized, reused, and updated, thereby building the capacity of the World Bank and partner organizations to quickly deliver complex data science solutions to pressing global challenges.

Data Lab-supported projects may include:

1. **Data**. Data Lab teams provide guidance on how to access the data underpinning all analyses, indicators, and insights. This transparency in data sources supports reproducibility and, critically, re-use in new countries and contexts, over time. Data may include:

> <u>Existing Data</u>. Each project may include a curation of datasets -- public and private -- that will support project objectives. The team prepares this curated list as a table, which includes data type, update frequency, access links, and contact information.
> <u>Digitized Government Data</u>. Where needed, a project may also include guidance on government data digitalization and/or management, leveraging AI methods to make disaggregated government records readily searchable and usable.
> <u>New Data Collection</u>. A Data Lab project may also incude a field data collection plan (and implementation of that plan, as needed) that includes some combination of household surveys, remote sensing (including drones), and crowdsourcing. Projects may also include guidance (and again, implementation of that guidance) on processing, storage, and cataloguing of all collected data.
>
> All Bank-produced datasets as part of the project can be hosted as a special collection on the World Bank's Data Catalogue, managed by the Development Economics Data Group (DECDG). The Catalogue receives more than 14 million unique users per month and will ensure value of the investment in data collection will be multiplied.
2. **Analytics**. Leveraging curated datasets, the team conducts analytics across a range of topics (e.g., understanding population movement in response to a crisis or monitoring trends in nighttime lights). Each analysis will include original code, documentation, links to original data sources (and/or information on how to access them), and a description of their limitations. Reference resources are also cited, where relevant.



3. **Additional Resources.** Links and descriptions of additional resources for each project may include:
- Description of common baseline data used to support the analyses -- administrative boundaries, population, infrastructure, etc.

- Project SharePoint where original data and documents are maintained.

- Additional static images and data visualizations.


4. **Project Team**. For each project, the [World Bank Data Lab](https://wbdatalab.org/) recruits colleagues from throughout the World Bank, pooling our collective data talents in support of our lending and technical assistance operations. Project packages include names and contact information for the unique teams that prepared the analytics.



## How Data Lab Projects are Managed

1. **Dynamic, Web-Hosted Documentation**. Unless specified otherwise, all code and documentation used to produce the analytics is hosted in a project GitHub repository to facilitate reuse for future updates and projects, as well as to support collaboration and capacity building activities.



2. **Data Catalogue**. Where possible, all datasets used in the production of Data Goods are added as entries to the World Bank’s [Development Data Hub](https://datacatalog.worldbank.org/home), where they are tagged with meta data, license attributes, and access information.



3. **Internal Project Management and File Sharing System**. To facilitate project management across teams, the Lab creates a Project SharePoint, which includes project management information (work plan, milestones, check-in slides, log of hours charged, final report), related literature, data files, indicator tables, and links to resources, such as this documentation. The advantage of SharePoint for World Bank usage is that all contents are automatically encrypted and tagged as Official Use Only. The project SharePoint is accessible to project team members and, with permission, can be replicated as a basis for future project updates or for similar projects.
4 changes: 2 additions & 2 deletions docs/_toc.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,9 +2,9 @@ format: jb-book
root: README

parts:
- caption: Introduction to Data Goods
- caption: Introduction to the Data Lab
chapters:
- file: docs/introduction_to_data_goods
- file: docs/1-intro-to-data-lab
- caption: Understanding Lebanon's Economy through Alternative Data
chapters:
- file: docs/foundational_datasets_and_data_products
Expand Down
6 changes: 0 additions & 6 deletions docs/foundational_datasets_and_data_products.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,9 +33,3 @@ Following is a summary of Data Products used in this Data Good:
| B | Trade Estimation using AIS data | Estimating trade activity through ports to and from Syria using shipping data | [Trade Estimation data on project SharePoint](https://worldbankgroup.sharepoint.com.mcas.ms/teams/DevelopmentDataPartnershipCommunity-WBGroup/Shared%20Documents/Forms/AllItems.aspx?csf=1&web=1&e=Yvwh8r&cid=fccdf23e%2D94d5%2D48bf%2Db75d%2D0af291138bde&FolderCTID=0x012000CFAB9FF0F938A64EBB297E7E16BDFCFD&id=%2Fteams%2FDevelopmentDataPartnershipCommunity%2DWBGroup%2FShared%20Documents%2FProjects%2FData%20Lab%2FLebanon%20Economic%20Analytics%2FData%2Ftrade&viewid=80cdadb3%2D8bb3%2D47ae%2D8b18%2Dc1dd89c373c5) | | 1,5 |
| C | Changes in observed conflict | Analysing changes in conflict | [Processed ACLED data on SharePoint](https://worldbankgroup.sharepoint.com.mcas.ms/teams/DevelopmentDataPartnershipCommunity-WBGroup/Shared%20Documents/Forms/AllItems.aspx?csf=1&web=1&e=Yvwh8r&cid=fccdf23e%2D94d5%2D48bf%2Db75d%2D0af291138bde&FolderCTID=0x012000CFAB9FF0F938A64EBB297E7E16BDFCFD&id=%2Fteams%2FDevelopmentDataPartnershipCommunity%2DWBGroup%2FShared%20Documents%2FProjects%2FData%20Lab%2FLebanon%20Economic%20Analytics%2FData%2Fconflicts%2Facled&viewid=80cdadb3%2D8bb3%2D47ae%2D8b18%2Dc1dd89c373c5) | | 1,2,3 |
| D | Population and Demographics | Comparing multiple population related datasources | Analysis on the web book | | 1,6,8 |

## Sample Indicators

**Indicators** can be derived from a combination of **Foundational Datasets** and **Data Products**. By combining Foundational Datasets and Data Products, teams can, on-demand, develop a large array of indicators to meet their project needs. Indicators can presented side-by-side in an Excel workbook -- a format that is generally accessible to the widest audiences. Because all indicators are based on the same underlying data, they are comparable with each other, across geographies and across time.

For this project, sample indicators have been derived and aggregated at the governorate level and by year, to show changing trends in each governorate over time.
23 changes: 0 additions & 23 deletions docs/introduction_to_data_goods.md

This file was deleted.

22 changes: 11 additions & 11 deletions docs/team.md
Original file line number Diff line number Diff line change
@@ -1,15 +1,15 @@
# Data Goods Team and Acknowledgements
# Project Team and Acknowledgements

The Data Lab would like to express our sincere gratutude and appreciation for the colleagues who worked together to prepare this Data Goods package:

| **Name** | **Role** | **Team** |
| ---------------------------------------------------------- | ------------------------------------------------------------------------------- | ------------------ |
| [Holly Krambeck](mailto:[email protected]) | Team captain; documentation lead | WB Data Lab, DECDG |
| [Robert Andrew Marty](mailto:[email protected]) | Analyst - Nighttime lights analytics | DIME |
| [Gabriel Stefanini Vicente](mailto:[email protected]) | Data Scientist - Monitoring migration | WB Data Lab, DECDG |
| [Sahiti Sarva](mailto:[email protected]) | Data Scientist - Monioring conflict and demograohics; project data management | WB Data Lab, DECDG |
| [Andres Chamorro]([email protected]) | Geographer - Maritime trade analytics | GOST, DECDG |
| [Juan Ignacio Fulponi]([email protected]) | Data Scientist - Mobility and traffic analytics | WB Data Lab, DECDG |
| | | |
| **Name** | **Role** | **Team** |
| ---------------------------------------------------------- | ------------------------------------------------------------ | ------------------ |
| [Holly Krambeck](mailto:[email protected]) | Data Lab Program Manager | WB Data Lab, DECDG |
| [Robert Andrew Marty](mailto:[email protected]) | Data Scientist - Nighttime lights analytics | DIME |
| [Gabriel Stefanini Vicente](mailto:[email protected]) | Data Scientist - Monitoring migration | WB Data Lab, DECDG |
| [Sahiti Sarva](mailto:[email protected]) | Data Scientist - Monioring conflict, demographics, air pollution, movement, and aviation trends | WB Data Lab, DECDG |
| [Andres Chamorro](mailto: [email protected]) | Geographer - Maritime trade analytics | GOST, DECDG |
| | | |
| | | |

The Data Lab would also like to express our appreciation for [Luan Zhao](mailto:[email protected]), Senior Economist and Task Team Leader for the Lebanon economic analytical work. Luan has provided enormous support (and encouragement!) to the Lab team, working alongside us as we experiment with new data methods and modes of collaboration, every step of the way.
The Data Lab would also like to express our appreciation for [Luan Zhao](mailto:[email protected]), Senior Economist and Task Team Leader, as well as Naji Abou Hamde for the Lebanon economic analytical work. Luan has provided enormous support (and encouragement!) to the Lab team, working alongside us as we experiment with new data methods and modes of collaboration, every step of the way.

0 comments on commit 7a93bcd

Please sign in to comment.