Link to LC notepad: https://pad.correlaid.org/CMrNZoCISn6_8UoI7fFStA?both#
Deadline: 10.09.2020 Weekly meetings on Tuesday evening Next meeting: September 1st, 56 pm)
-
✅ Comparison of accidents (maybe also traffic?) of different large cities
- Data:
datenguidepy
- Task: echarts4r implementation in Shiny app
- → Cédric
- Data:
-
Accidents and bike lanes in Berlin
- Data: Radverkehrslage + Unfallatlas 2019 (Tagesspiegel)
- Task: tmap implementation in Shiny app
- → Saleh / Cédric
-
Aggregation bike lane types
- Data: Radverkehrslage
- Task: Merge bike lane info to max. 6 categories (including "no bike lane")
- → Steffen
-
Accidents per bike lane type
- Data: Radverkehrslage + Unfallatlas 2019 (Tagesspiegel)
- Task: extract road id for each accident (
geosphere::dist2Line()
?) - → Andreas (Cédric as back-up)
-
Accidents/Risk per square
- Data: Radverkehrslage + Unfallatlas 2019 (Tagesspiegel)
- Task: Grid for Berlin (
sf::st_make_grid(data, cellsize = c(x, y))
) and counts of accidents per bike lane type (sf::st_join(points, grid, join = st_intersects)
) - → Steffen & Andreas
-
Texts accidents general
- Data:
datenguidepy
- Task: Paragraphs for general intro with regard to all road users all over Germany
- Content: Intro to topic, why accidents, what did we find in the comparison, what's exciting to look at in the following graph
- → Steffen
- Data:
-
Texts bikes in Berlin
- Data: Radverkehrslage + Unfallatlas
- Task: Paragraphs for bike accidents in Berlin
- Content: Why Berlin, what's new, link to bike lane improvements (many pictures) and survey (check also this article), why accidents, what did we find in the comparison, what's exciting to look at in the following graph
- → Steffen
-
Interesting title
- Task: Find title (and subtitle) that matches the project and is exciting
- → Steffen
-
Color choice
- Task: Fit colors to topic: less friendly (greenish → reddish? blueish?)
- → Cédric with feedback from others
-
Additional pictures?
- Task: Do we want pictures? Which?
- → EVERYONE
-
Additional plots?
- Task: Do we want other plots and if which? → YES, take-home-message!
- Summary stats per bike lane type
- Heatmap accidents Berlin?
- Summary stats per district?
- → EVERYONE
- Task: Do we want other plots and if which? → YES, take-home-message!
-
Comparison of accidents before and after bike lane improvement for selected roads
- Potential Problems:
- External effects (account for rush hour, season, etc.)
- Data - enough after improvement available? → Andreas
- Potential Problems:
-
Tool to find direction with lowest risk and highest proportion of bike lanes
- Potential Problems:
- too ambitious?
- Potential Problems:
-
Hot spot analysis (which factors play a role? Rush hour, road type, bike lane type, distance to city centre, public transport stations/hubs, ...)
- Potential Problems:
- Data - enough information for proper analysis
- Otherwise maybe only descriptive/informative :/
- Potential Problems:
datenguidepy
the dataset XY for comparison of big cities Germany-wide (or just Brandenburg?)- Unfallatlas data for detailled Berlin data
- Unfallschwerpunkte ('accident hotspots') by Berlin Police
- Traffic data for motorised vehicles from 2014
- Reports on bike accidents by Berlin Police (all reports under section "Radfahrer")
- Improved bike lanes in Berlin by InfraVelo with before-after pictures
- FixMyBerlin Survey Data (21,000 participants about risk perception of different bike lane setups)
Scripts:
- R script with searchable table of all stats by Cédric
- R script preparing Radverkehrsanlagen for Mapview by Andreas
- R script preparing Unfallatlas data by Cédric
- Explorative Python script by Lara
Resources general:
- A collection of (all?!) Berlin's spatial units including districts, district areas, LORs, traffic cells, corridors, and ZIP code areas
- BerlinOpenData for a range of (potentially) interesting data about Berlin
Resources accidents and bike lanes:
- Unfallatlas by Destatis
- CartoDB Berlin Map by Stefan Wehrmeyer
- FixMyBerlin Survey Data (21,000 participants about risk perception of different bike lane setups)
- Reports on accidents by Berlin Police
Other reports, studies, visualizations on accidents and bike Lanes:
- Tagesspiegel article on FixMyBike survey with some nice visualizations
- Tagesspiegel article from 2018 on bike lane quality
- Data via datenguidepy
- Have a look at the data
- Think about possible projects/topics
- Make yourself familiar with Python usage/data import etc.
→ Participants: Andreas, Cédric, Michael & Saleh
Ideas:
- Urban agriculture (Saleh)
- Green spaces (#/area) in Berlin (Michael)
- Bike lanes and car accidents (Andreas)
- Living versus non-living space & rental prices (Cédric)
- (Movement of young people to Berlin (East/West))
In general
- East/West comparison
- Comparison Germany-wide
Next steps:
- Explore the datasets in more detail via
datenguidepy
(own Rmd script, Jupyter notebook, Laras or Cédric's scripts...) - Search for additional resources (one requirement—or a "nice-to-have"—acoording to the rules are external data)
- Check news, articles, studies on that topic for...
- interesting findings/patterns/stories in other cities
- detailed reports for Berlin on the same topic (might either help or let's us discard the idea)
→ Participants: Andreas, Cédric, Lara & Saleh
-
Only Berlin-level data available for most (all?!) stats
- need for external data
- and/or comparison to Brandenburg (Gemeinden available) or other cities
-
Urban agriculture (Saleh)
- not yet any detaileld data or trends but many stats
- Saleh continues as a side project/plan B
-
Bike lanes and car accidents (Andreas)
- unfortunately only data up to 2018
- maybe something like Unfallatlas by Destatis or CartoDB Berlin Map by Stefan Wehrmeyer
- maybe combine with survey data on bike lane design by FixMyBerlin
-
Living versus non-living space & rental prices (Cédric)
- only Berlin-level data, often not many years and/or stable trends
-
Green spaces (#/area) in Berlin (Michael)
- not present
-
→ Team decision to focus on bike accidents/lanes in Berlin
- Unfallatlas with additional insights/information
- Ideas:
- combine with survey data?
- square mile representation of high/low risk areas?
- maybe combine with public transport information?
- check different street types, districts etc.
- Tools:
- interactive app? → Shiny/Leaflet/Plotly
- Static maps (square mile bubbles) →mnaybe inspired by Alberto López
- Data:
- Overview charts and/or comparison with other cities via datenguidepy
- Detailed maps of accidents via data from destatis ("Unfallatlas")
Next Steps:
- Discuss how we maybe could include survey dats (Cédric & Lara + everyone interested, mainly on Slack)
- Explore data sources (Andreas)
- Check first stats: bike accidents (we only focus on bikes now right?) per district, road type, time, .... (summaries not maps) (Cédric + anyone who is interested)
- Explore urban agriculture data (Saleh)
→ Participants: Andreas, Cédric & Saleh
- Accidents of cyclist in relation to bike lane types
- Part 1: Overall comparison between cities Germany wide to include
datenguidepy
- Part 2: Static or interactive map of bike lane type and accidents + hotspots
- Part 1: Overall comparison between cities Germany wide to include
- Still to fix some issues with Python and detailled data
- Do we really need to digitalize the reports from the Berlin Police Department? → Let's try to use the Unfallatlas data first
- Please upload all scripts and output on GitHub and actively engage in discussions on Slack
- Discuss main topic - what do we answer with our analysis and vis?
- Static or interactive? (depends mostly on time left)
- Do we need more data?
- Explore data availability of bike accidents per year for different cities (largest cities in Germany and/or Brandenburg) with
datenguidepy
→ Saleh - Visualize the findings (e.g. bar chart or slope chart of cases per x inhabitants per city over time) → Saleh (if you need help/have no time don't hesitate to let us know)
- Import and investigate Unfallatlas data → Cédric
- Visualize temporal and spatial patterns → Cédric
- Explore data availability and mergeability for before-after bike lane improvement (main focus: Unfallatlas - if not: PDF? Other BerlinOpenData Portal? other?) → Andreas (if you need help/some discussion don't hesitate to let us know)
- Digitalize data of bike lane quality before/after plus filtered data for these streets/areas → Andreas
→ 5:30 pm via Zoom
→ Participants: Andreas, Cédric, Saleh & Steffen
- Accidents of cyclist in relation to bike lane types
- Part 1: Overall comparison between cities Germany wide to include
datenguidepy
- Part 2: Static or interactive map of bike lane type and accidents + hotspots
- Part 1: Overall comparison between cities Germany wide to include
- Andreas:
- Shiny map showing bike lane types in Berlin works locally but problems on server
- Still searching for a way/someone to digitalize the Police reports → very coarse so we decided for now to NOT use this source anyway
- To say anything about how good or bad specific bike lane types are we need to acocunt for several factors → actually time of the day and year may not be a problem since we have the same data quality for all areas butg wse should definitely account for traffic
- So far we only have traffic daat for motorised vehicles from 2014; some including bikes (and maybe even pedestriants) would be nice!
- If we find proper treaffic data, we need to find the road for each geolocation in the Unfallatlas data to match the accident with the bike lane type
- If we can't accoutn for traffic we anyway can only show aboslute nubmers and no details are probably needed (only for interaoctive so people can filter by different bike lane type?)
- Cédric:
- Saleh:
- Problems with saving csv solved → still would be cool to get reticulate to work so we can access the data directly from the
datenguidypy
package! - First example visualizatiosn for Saarland on GitHub
- We need data for largest cities not federal states → decided to use 5 largest cities by population to compare with Berlin; this includes 6 different federal states in different directions of Germany as well as the two most populated cities (Munich and Berlin) (Wiki link)
- Problems with saving csv solved → still would be cool to get reticulate to work so we can access the data directly from the
- Main goal: Comparison of accidents (maybe also traffic?) of different large cities
- Problem:
datenguidepy
provides these stats but each combination of stat and city has to be queried. To work efficiently and visulaize the data all together, we need all relvant stats for chosen cities in one dataframe. - Task: Create one final dataset with all 6 cities and several statistics in one dataframe (script and data file, e.g. as Rds, on GitHub)
- Stats: accidents with injuries per x cars/x inhabitants and year
- Cities: 5 largest cities with regard to population besides Berlin in Germany: Berlin, Hamburg, Munich, Cologne, Frankfurt (Main) and Stuttgart)
→ Saleh, with the help of Cédric if needed
- Main goal: Comparison of accidents (maybe also traffic?) of different large cities
- Task: Visualizations of summary stats as comparison between cities and years (script and report/images on GitHub)
→ Saleh & Cédric
- Main goal: Comparison of accidents per x road users for different areas/bike lane types
- Problem: Unfallatlas provides absolute numbers—however, if we want to compare different bike lane types we need somehow to acocunt for the traffic (#accidents / #cyclist). We have some traffic data on motorised vehicles from 2014 (link) but more detailed data on all types of road users would be good
- Task: Search for other traffic data for Berlin, in the best case recent and including bikes
→ Andreas & Cédric, with feedback from others
- Main goal: Comparison of accidents per x road users for different areas/bike lane types
- Problem: The Unfallatlas data contains geolocations (lat/long) while the bike lane data is based on roads
- Task: Match the geolocations to the (closest) road and extract number of acidents per road, area, and bike lane type.
→ Andreas, with the help of Cédric if needed
- Main goal: Comparison of accidents per x road users for different areas/bike lane types
- Task: Depending on the suitable traffic data we either
- show simply bike lane types and absolute numbers of accidents (thus no need to match geolocations to rads) + hotspots + (number of accidents per district?)
- show relative risk of bike accidents + additional charts comparing relative rik per bike lane type, district, ...
- Match the geolocations to the (closest) road and extract number of acidents per road, area, and bike lane type.
→ Andreas & Cédric
→ 6 pm via Zoom
→ Participants: Andreas, Cédric, Saleh & Steffen
-
Andreas:
- upload location data
- matching locations to lines
- calculate summaries per bike lanes
- add rasterize Berlin + summarize data
-
Cédric:
- interactive line chart
- {tmap}
- matching locations to lines (back-up)
-
Saleh:
- {tmap}
-
Steffen:
- aggregate bike lane types
- text paragraphs
- add rasterize Berlin + summarize data