Skip to content

Latest commit

 

History

History
350 lines (257 loc) · 17 KB

README.md

File metadata and controls

350 lines (257 loc) · 17 KB

CorrelAid X Challenge Berlin

Link to LC notepad: https://pad.correlaid.org/CMrNZoCISn6_8UoI7fFStA?both#

Deadline: 10.09.2020 Weekly meetings on Tuesday evening Next meeting: September 1st, 56 pm)


Next Steps

Visualizations

  • ✅ Comparison of accidents (maybe also traffic?) of different large cities

    • Data: datenguidepy
    • Task: echarts4r implementation in Shiny app
    • → Cédric
  • Accidents and bike lanes in Berlin

    • Data: Radverkehrslage + Unfallatlas 2019 (Tagesspiegel)
    • Task: tmap implementation in Shiny app
    • → Saleh / Cédric

Geodata Processing

  • Aggregation bike lane types

    • Data: Radverkehrslage
    • Task: Merge bike lane info to max. 6 categories (including "no bike lane")
    • → Steffen
  • Accidents per bike lane type

    • Data: Radverkehrslage + Unfallatlas 2019 (Tagesspiegel)
    • Task: extract road id for each accident (geosphere::dist2Line()?)
    • → Andreas (Cédric as back-up)
  • Accidents/Risk per square

    • Data: Radverkehrslage + Unfallatlas 2019 (Tagesspiegel)
    • Task: Grid for Berlin (sf::st_make_grid(data, cellsize = c(x, y))) and counts of accidents per bike lane type (sf::st_join(points, grid, join = st_intersects))
    • → Steffen & Andreas

App Design

  • Texts accidents general

    • Data: datenguidepy
    • Task: Paragraphs for general intro with regard to all road users all over Germany
    • Content: Intro to topic, why accidents, what did we find in the comparison, what's exciting to look at in the following graph
    • → Steffen
  • Texts bikes in Berlin

    • Data: Radverkehrslage + Unfallatlas
    • Task: Paragraphs for bike accidents in Berlin
    • Content: Why Berlin, what's new, link to bike lane improvements (many pictures) and survey (check also this article), why accidents, what did we find in the comparison, what's exciting to look at in the following graph
    • → Steffen
  • Interesting title

    • Task: Find title (and subtitle) that matches the project and is exciting
    • → Steffen
  • Color choice

    • Task: Fit colors to topic: less friendly (greenish → reddish? blueish?)
    • → Cédric with feedback from others
  • Additional pictures?

    • Task: Do we want pictures? Which?
    • → EVERYONE
  • Additional plots?

    • Task: Do we want other plots and if which? → YES, take-home-message!
      • Summary stats per bike lane type
      • Heatmap accidents Berlin?
      • Summary stats per district?
    • → EVERYONE

Potential Topics:

  • Comparison of accidents before and after bike lane improvement for selected roads

    • Potential Problems:
      • External effects (account for rush hour, season, etc.)
      • Data - enough after improvement available? → Andreas
  • Tool to find direction with lowest risk and highest proportion of bike lanes

    • Potential Problems:
      • too ambitious?
  • Hot spot analysis (which factors play a role? Rush hour, road type, bike lane type, distance to city centre, public transport stations/hubs, ...)

    • Potential Problems:
      • Data - enough information for proper analysis
      • Otherwise maybe only descriptive/informative :/

Data

Resources

Scripts:

Resources general:

Resources accidents and bike lanes:

Other reports, studies, visualizations on accidents and bike Lanes:




Meetings

Meeting July 21st

Next steps

  • Have a look at the data
  • Think about possible projects/topics
  • Make yourself familiar with Python usage/data import etc.

Meeting July 28th

→ Participants: Andreas, Cédric, Michael & Saleh

Ideas:

  • Urban agriculture (Saleh)
  • Green spaces (#/area) in Berlin (Michael)
  • Bike lanes and car accidents (Andreas)
  • Living versus non-living space & rental prices (Cédric)
  • (Movement of young people to Berlin (East/West))

In general

  • East/West comparison
  • Comparison Germany-wide

Next steps:

  • Explore the datasets in more detail via datenguidepy (own Rmd script, Jupyter notebook, Laras or Cédric's scripts...)
  • Search for additional resources (one requirement—or a "nice-to-have"—acoording to the rules are external data)
  • Check news, articles, studies on that topic for...
    • interesting findings/patterns/stories in other cities
    • detailed reports for Berlin on the same topic (might either help or let's us discard the idea)

Meeting August 4th

→ Participants: Andreas, Cédric, Lara & Saleh

  • Only Berlin-level data available for most (all?!) stats

    • need for external data
    • and/or comparison to Brandenburg (Gemeinden available) or other cities
  • Urban agriculture (Saleh)

    • not yet any detaileld data or trends but many stats
    • Saleh continues as a side project/plan B
  • Bike lanes and car accidents (Andreas)

  • Living versus non-living space & rental prices (Cédric)

    • only Berlin-level data, often not many years and/or stable trends
  • Green spaces (#/area) in Berlin (Michael)

    • not present
  • → Team decision to focus on bike accidents/lanes in Berlin

    • Unfallatlas with additional insights/information
    • Ideas:
      • combine with survey data?
      • square mile representation of high/low risk areas?
      • maybe combine with public transport information?
      • check different street types, districts etc.
    • Tools:
    • Data:
      • Overview charts and/or comparison with other cities via datenguidepy
      • Detailed maps of accidents via data from destatis ("Unfallatlas")

Next Steps:

  • Discuss how we maybe could include survey dats (Cédric & Lara + everyone interested, mainly on Slack)
  • Explore data sources (Andreas)
  • Check first stats: bike accidents (we only focus on bikes now right?) per district, road type, time, .... (summaries not maps) (Cédric + anyone who is interested)
  • Explore urban agriculture data (Saleh)

Meeting August 11th

→ Participants: Andreas, Cédric & Saleh

Main Idea:

  • Accidents of cyclist in relation to bike lane types
    • Part 1: Overall comparison between cities Germany wide to include datenguidepy
    • Part 2: Static or interactive map of bike lane type and accidents + hotspots

General Updates:

  • Still to fix some issues with Python and detailled data
  • Do we really need to digitalize the reports from the Berlin Police Department? → Let's try to use the Unfallatlas data first
  • Please upload all scripts and output on GitHub and actively engage in discussions on Slack

Open questions:

  • Discuss main topic - what do we answer with our analysis and vis?
  • Static or interactive? (depends mostly on time left)
  • Do we need more data?

Next Steps:

  • Explore data availability of bike accidents per year for different cities (largest cities in Germany and/or Brandenburg) with datenguidepy → Saleh
  • Visualize the findings (e.g. bar chart or slope chart of cases per x inhabitants per city over time) → Saleh (if you need help/have no time don't hesitate to let us know)
  • Import and investigate Unfallatlas data → Cédric
  • Visualize temporal and spatial patterns → Cédric
  • Explore data availability and mergeability for before-after bike lane improvement (main focus: Unfallatlas - if not: PDF? Other BerlinOpenData Portal? other?) → Andreas (if you need help/some discussion don't hesitate to let us know)
  • Digitalize data of bike lane quality before/after plus filtered data for these streets/areas → Andreas

Meeting August 18th

→ 5:30 pm via Zoom

→ Participants: Andreas, Cédric, Saleh & Steffen

Main Idea (same as before):

  • Accidents of cyclist in relation to bike lane types
    • Part 1: Overall comparison between cities Germany wide to include datenguidepy
    • Part 2: Static or interactive map of bike lane type and accidents + hotspots

General Updates:

  • Andreas:
    • Shiny map showing bike lane types in Berlin works locally but problems on server
    • Still searching for a way/someone to digitalize the Police reports → very coarse so we decided for now to NOT use this source anyway
    • To say anything about how good or bad specific bike lane types are we need to acocunt for several factors → actually time of the day and year may not be a problem since we have the same data quality for all areas butg wse should definitely account for traffic
    • So far we only have traffic daat for motorised vehicles from 2014; some including bikes (and maybe even pedestriants) would be nice!
    • If we find proper treaffic data, we need to find the road for each geolocation in the Unfallatlas data to match the accident with the bike lane type
    • If we can't accoutn for traffic we anyway can only show aboslute nubmers and no details are probably needed (only for interaoctive so people can filter by different bike lane type?)
  • Cédric:
    • Unfallatlas data available for everyone on GitHub, first visulaizations of trends
    • For Berlin, there are unfortunately only 2 years (2018 and 2019) with roughly 5.000 accidents involving bikes per year
  • Saleh:
    • Problems with saving csv solved → still would be cool to get reticulate to work so we can access the data directly from the datenguidypy package!
    • First example visualizatiosn for Saarland on GitHub
    • We need data for largest cities not federal states → decided to use 5 largest cities by population to compare with Berlin; this includes 6 different federal states in different directions of Germany as well as the two most populated cities (Munich and Berlin) (Wiki link)

Detailed next steps:

✦ Collect accident data for several cities with datenguidepy

  • Main goal: Comparison of accidents (maybe also traffic?) of different large cities
  • Problem: datenguidepy provides these stats but each combination of stat and city has to be queried. To work efficiently and visulaize the data all together, we need all relvant stats for chosen cities in one dataframe.
  • Task: Create one final dataset with all 6 cities and several statistics in one dataframe (script and data file, e.g. as Rds, on GitHub)
    • Stats: accidents with injuries per x cars/x inhabitants and year
    • Cities: 5 largest cities with regard to population besides Berlin in Germany: Berlin, Hamburg, Munich, Cologne, Frankfurt (Main) and Stuttgart)

→ Saleh, with the help of Cédric if needed

✦ Visualize accident data for several cities

  • Main goal: Comparison of accidents (maybe also traffic?) of different large cities
  • Task: Visualizations of summary stats as comparison between cities and years (script and report/images on GitHub)

→ Saleh & Cédric

✦ Find traffic data to estimates rates

  • Main goal: Comparison of accidents per x road users for different areas/bike lane types
  • Problem: Unfallatlas provides absolute numbers—however, if we want to compare different bike lane types we need somehow to acocunt for the traffic (#accidents / #cyclist). We have some traffic data on motorised vehicles from 2014 (link) but more detailed data on all types of road users would be good
  • Task: Search for other traffic data for Berlin, in the best case recent and including bikes

→ Andreas & Cédric, with feedback from others

✦ Merge geodata with roads (only if suitable traffic data available)

  • Main goal: Comparison of accidents per x road users for different areas/bike lane types
  • Problem: The Unfallatlas data contains geolocations (lat/long) while the bike lane data is based on roads
  • Task: Match the geolocations to the (closest) road and extract number of acidents per road, area, and bike lane type.

→ Andreas, with the help of Cédric if needed

✦ Visualize accidents (absolute or relative)

  • Main goal: Comparison of accidents per x road users for different areas/bike lane types
  • Task: Depending on the suitable traffic data we either
    • show simply bike lane types and absolute numbers of accidents (thus no need to match geolocations to rads) + hotspots + (number of accidents per district?)
    • show relative risk of bike accidents + additional charts comparing relative rik per bike lane type, district, ...
    • Match the geolocations to the (closest) road and extract number of acidents per road, area, and bike lane type.

→ Andreas & Cédric

Mext Meeting: August 25th

→ 6 pm via Zoom

→ Participants: Andreas, Cédric, Saleh & Steffen

  • Andreas:

    • upload location data
    • matching locations to lines
    • calculate summaries per bike lanes
    • add rasterize Berlin + summarize data
  • Cédric:

    • interactive line chart
    • {tmap}
    • matching locations to lines (back-up)
  • Saleh:

    • {tmap}
  • Steffen:

    • aggregate bike lane types
    • text paragraphs
    • add rasterize Berlin + summarize data

Mext Meeting: September 2nd