Skip to content

aprilcoffee/how-to-make-human-machine-readable

 
 

Repository files navigation

Notebooks from the Seminar:

Human Machine Readable < WS21/22

Introduction into programming

Georg Trogemann, Christian Heck, Mattis Kuhn, Ting Chun Liu

Basic Seminar Material/Sculpture/Code

Compact seminar 11 - 4 pm | 31.01.2022 until 11.02.2022

Online @ BigBlueButton

Experimental Informatics

Academy of Media Arts Cologne

Email: [email protected], [email protected], [email protected]

Description

The generation of text by means of deep neural nets (NLG) has spread rapidly. Among other things, text-based dialog systems such as chatbots, assistance systems (Alexa/Siri) or robot journalism are increasingly used in news portals, e-commerce and social media; wherever context-based, natural language or reader-friendly texts are to be generated from structured data. Deep writing techniques have also found their way into the arts and literature with the help of models such as ELMo (Embeddings from Language Models), BERT (Bidirectional Encoder Representations from Transformers) or GPT-2/3 (Generative Pre-Training Transformer).

The goal of the seminar is that at the end each student has produced (a) text based on one of the neural language models mentioned above. No matter if poem, prose, novella, essay, manifesto, shopping list or social bot.

The course is intended as a general introduction to programming. It will not only teach skills to generate texts, but also the basics of Python, a universal programming language that can be used to program images, PDFs or web applications. Furthermore, Python is the most widely used language in programming Artificial Intelligences, especially Deep Neural Nets.

We ask for registration at [email protected] until 20.09.2021. No prior knowledge of programming is required to participate in the basic seminar.

Course

Week 1 (31.1. - 4.2.)

Hands on Jupyter Notebooks

Introduction

Hands on Python

Python: Variables

Python: Loops & Lists

Python: Booleans, If - Else, While-loop

Python: Strings, Files, Try & Except

Python: Functions

Python: Modules

Python: Tuples, Dictionaries, Set

Python: Class / OOP

Hands on Datasets

dataset-list < some resources of datasets & archives

scrape-load_textcorpora < some basic examples and code-snippets to srape, load and walk through datasets

scraper_wikipedia < extract text of specific wikipedia articles

clean_datasets

Coding books with Python

First book

Programmed books 2

Programmed books 3

Programmed books 4


Week 2 (7.2. - 11.2.)

Hands on Text as Data

0-order text generation < random word generation, wiederholung von Char, String and List

Data cleaning and Parsing < python method for parsing text as data

1-order text generation and Probability < probability calculation

Hands on Markov Chain

Markov Chain - Background and knowledge < basic knowledge of Markov chain

Markov Chain - Basic (Second Order Text Generation < Basic usage of Markov chain with second order text generation.

Markov Chain - N-order Text Generation < N-Order text generation.

Markov Chain - OOP < Markov Chain based on object oriented programming.

Markov Chain - Markovify-library < Markov Chain based on github repo https://github.com/jsvine/markovify

Additional - Markov Chain with Image Image Generation based on Markov Chain

Hands on Artificial Neural Networks (ANN)

ANN-in-Keras.ipynb < Dense Neural Network with Keras

+ working with Copilot

Hands on Recurrent Neural Networks (RNN) / Long Short Term Memory (LSTM) Networks

Text generation with LSTM < Text generation with RNN/LSTM

Hands on GPT-?

HuggingFace Pipeline < the HuggingFace way to use state-of-the-art NLP-models for inference

aitextgen < Python tool for text-based AI training and generation using GPT-2


General Info

Executing the Notebooks:

  • You can run, execute and work on the Notebooks in typing following button: Binder

Cheat Sheets

Title URL
Python Beginner Cheat Sheet https://github.com/ehmatthes/pcc/releases/download/v1.0.0/beginners_python_cheat_sheet_pcc_all.pdf
Markdown Syntax https://help.github.com/articles/basic-writing-and-formatting-syntax/
Jupyter Notebook https://cheatography.com/weidadeyue/cheat-sheets/jupyter-notebook/pdf_bw/
Conda https://docs.conda.io/projects/conda/en/latest/_downloads/843d9e0198f2a193a3484886fa28163c/conda-cheatsheet.pdf

Binder

About

Repo 4 basic seminar §How to make human machine readable"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 55.3%
  • HTML 44.7%