diff --git a/README.md b/README.md index 283e350ab..9e32d2c28 100644 --- a/README.md +++ b/README.md @@ -2,7 +2,7 @@ :information_source:  This repo contains questions and exercises on various technical topics, sometimes related to DevOps and SRE :) -:bar_chart:  There are currently **1840** questions +:bar_chart:  There are currently **1899** questions :books:  To learn more about DevOps and SRE, check the resources in [devops-resources](https://github.com/bregman-arie/devops-resources) repository @@ -785,7 +785,24 @@ For example, you might configure the workflow to trigger every time a changed is
-In Git
+True or False? In Github Actions, jobs are executed in parallel by deafult
+ +True +
+ +
+How to create dependencies between jobs so one job runs after another?
+ +Using the "needs" attribute/directive. + +``` +jobs: + job1: + job2: + needs: job1 +``` + +In the above example, job1 must complete successfully before job2 runs
@@ -900,6 +917,16 @@ Read more about auto scaling [here](https://aws.amazon.com/autoscaling) False. Auto scaling adjusts capacity and this can mean removing some resources based on usage and performances.
+#### Cloud - Security + +
+How to secure instances in the cloud?
+ + * Instance should have minimal permissions needed. You don't want an instance-level incident to become an account-level incident + * Instances should be accessed through load balancers or bastion hosts. In other words, they should be off the internet (in a private subnet behind a NAT). + * Using latest OS images with your instances (or at least apply latest patches) +
+ ## AWS ### AWS Exercises @@ -1433,6 +1460,14 @@ Learn more about it [here](https://aws.amazon.com/compliance/shared-responsibili What is the AWS compliance program?
+
+How to secure instances in AWS?
+ + * Instance IAM roles should have minimal permissions needed. You don't want an instance-level incident to become an account-level incident + * Use "AWS System Manager Session Manager" for SSH + * Using latest OS images with your instances +
+
What is AWS Artifact?
@@ -5948,7 +5983,7 @@ True
-Using the 'latest' tag when pulling an image means, you are pulling the most recently published image
+True or False? Using the 'latest' tag when pulling an image means, you are pulling the most recently published image
False. While this might be true in some cases, it's not guaranteed that you'll pull the latest published image when using the 'latest' tag.
For example, in some images, 'edge' tag is used for the most recently published images. @@ -6076,6 +6111,20 @@ Look for "Cmd" or "Entrypoint" fields in the output of `docker image inspec
+
+What is the role of cache in image builds?
+ +When you build an image for the first time, the different layers are being cached. So, while the first build of the image might take time, any other build of the same image (given that Dockerfile didn't change or the content used by the instructions) will be instant thanks to the caching mechanism used. + +In little bit more details, it works this way: +1. The first instruction (FROM) will check if base image already exists on the host before pulling it +2. For the next instruction, it will check in the build cache if an existing layer was built from the same base image + if it used the same instruction + 1. If it finds such layer, it skips the instruction and links the existing layer and it keeps using the cache. + 2. If it doesn't find a matching layer, it builds the layer and the cache is invalidated. + +Note: in some cases (like COPY and ADD instructions) the instruction might stay the same but if the content of what being copied is changed then the cache is invalidated. The way this check is done is by comparing the checksum of each file that is being copied. +
+
What ways are there to reduce container images size?
@@ -6085,6 +6134,17 @@ Look for "Cmd" or "Entrypoint" fields in the output of `docker image inspec
+
+What are the pros and cons of squashing images?
+ +Pros: + * Smaller image + * Reducing number of layers (especially if the image has lot of layers) +Cons: + * No sharing of the image layers + * Push and pull can take more time (because no matching layers found on target) +
+ #### Containers - Volume
@@ -6102,7 +6162,7 @@ Different container engines (e.g. Docker, Podman) can build images automatically
-What is the first line in all Dockefiles and what does it mean?
+What is the instruction in all Dockefiles and what does it mean?
The first instruction is `FROM `
It specifies the base layer of the image to be used. Every other instruction is a layer on top of that base image. @@ -6126,6 +6186,13 @@ It specifies the base layer of the image to be used. Every other instruction is * Do not use environment variables to share secrets * Use images from official repositories * Keep images small! - you want them only to include what is required for the application to run successfully. Nothing else. + * If are using the apt package manager, you might use 'no-install-recommends' with `apt-get install` to install only main dependencies (instead of suggested, recommended packages) +
+ +
+What is the "build context"?
+ +[Docker docs](https://docs.docker.com/engine/reference/commandline/build): "A build’s context is the set of files located in the specified PATH or URL"
@@ -6365,21 +6432,6 @@ Create a new image from a container’s changes Via the local socket at `/var/run/docker.sock`
-
-Explain what is Docker compose and what is it used for
- -Compose is a tool for defining and running multi-container Docker applications. With Compose, you use a YAML file to configure your application’s services. Then, with a single command, you create and start all the services from your configuration. - -For example, you can use it to set up ELK stack where the services are: elasticsearch, logstash and kibana. Each running in its own container. -
- -
-Describe the process of using Docker Compose

- -* Define the services you would like to run together in a docker-compose.yml file -* Run `docker-compose up` to run the services -
-
Explain Docker interlock
@@ -6413,6 +6465,24 @@ Because each container has its own writable container layer, and all changes are How do you copy files from Docker container to the host and vice versa?
+#### Containers - Docker Compose + +
+Explain what is Docker compose and what is it used for
+ +Compose is a tool for defining and running multi-container Docker applications. With Compose, you use a YAML file to configure your application’s services. Then, with a single command, you create and start all the services from your configuration. + +For example, you can use it to set up ELK stack where the services are: elasticsearch, logstash and kibana. Each running in its own container.
+In general, it's useful for running applications which composed out of several different services. It let's you manage it as one deployed app, instead of different multiple separate services. +
+ +
+Describe the process of using Docker Compose

+ +* Define the services you would like to run together in a docker-compose.yml file +* Run `docker-compose up` to run the services +
+ #### Containers - Docker Images
@@ -6469,10 +6539,52 @@ By default, Docker uses everything (all the files and directories) in the direct `.dockerignore` used for excluding files and directories from the build context
+#### Containers - Networking + +
+What container network standards or architectures are you familiar with?
+ +CNM (Container Network Model): + * Requires distrubited key value store (like etcd for example) for storing the network configuration + * Used by Docker +CNI (Container Network Interface): + * Network configuration should be in JSON format +
+ +#### Containers - Docker Networking + +
+What network specification Docker is using and how its implementation is called?
+ +Docker is using the CNM (Container Network Model) design specification.
+The implementation of CNM specification by Docker is called "libnetwork". It's written in Go. +
+ +
+Explain the following blocks in regards to CNM: + + * Networks + * Endpoints + * Sandboxes
+ + * Networks: software implementation of an switch. They used for grouping and isolating a collection of endpoints. + * Endpoints: Virtual network interfaces. Used for making connections. + * Sandboxes: Isolated network stack (interfaces, routing tables, ports, ...) +
+ #### Containers - Security
-A container can cause a kernel panic and bring down the whole host. What preventive actions can you apply to avoid it?
+What security best practices are there regarding containers?
+ * Install only the necessary packages in the container + * Don't run containers as root when possible + * Don't mount the Docker daemon unix socket into any of the containers + * Set volumes and container's filesystem to read only + * DO NOT run containers with `--privilged` flag +
+ +
+A container can cause a kernel panic and bring down the whole host. What preventive actions can you apply to avoid this specific situation?
* Install only the necessary packages in the container * Set volumes and container's filesystem to read only @@ -8382,6 +8494,20 @@ Or directly on the command line: `helm install --set some_key=some_value` Helm allows you to upgrade, remove and rollback to previous versions of charts. In version 2 of Helm it was with what is known as "Tiller". In version 3, it was removed due to security concerns.
+#### Kubernetes - Security + +
+What best practices do you follow in regards to the Kubernetes cluster?
+ + * Secure inter-service communication (one way is to use Istio to provide mutual TLS) + * Isolate different resources into separate namespaces based on some logical groups + * Use supported container runtime (if you use Docker then drop it because it's deprecated. You might want to CRI-O as an engine and podman for CLI) + * Test properly changes to the cluster (e.g. consider using Datree to prevent kubernetes misconfigurations) + * Limit who can do what (by using for example OPA gatekeeper) in the cluster + * Use NetworkPolicy to apply network security + * Consider using tools (e.g. Falco) for monitoring threats +
+ #### Submariner
@@ -11673,6 +11799,10 @@ Running parallel and high-performance computing applications #### Azure - Network +
+What Azure network services are you familiar with?
+
+
What's an Azure region?
@@ -11683,6 +11813,10 @@ Running parallel and high-performance computing applications #### Azure Storage +
+What Azure storage services are you familiar with?
+
+
What storage options Azure supports?
@@ -13337,18 +13471,14 @@ It's an architecture in which data is and retrieved from a single, non-shared, s * Browser cache * Operating system cache * The DNS server configured on the user's system (can be ISP DNS, public DNS, ...) - 2. If it couldn't find a DNS record locally, a full DNS resolution is started. - 3. It connects to the server using the TCP protocol - 4. The browser sends an HTTP request to the server - 5. The server sends an HTTP response back to the browser - 6. The browser renders the response (e.g. HTML) - 7. The browser then sends subsequent requests as needed to the server to get the embedded links, javascript, images in the HTML and then steps 3 to 5 are repeated. + +TODO: add more details!
#### API @@ -13396,6 +13526,18 @@ While automation focuses on a task level, Orchestration is the process of automa What is a Debuggger and how it works?
+
+What services an application might have?
+ + * Authorization + * Logging + * Authentication + * Ordering + * Front-end + * Back-end + ... +
+
What is Metadata?
@@ -14054,6 +14196,18 @@ Not only this will tell you what is expected from you, it will also provide big ## Databases +|Name|Topic|Objective & Instructions|Solution|Comments| +|--------|--------|------|----|----| +| Message Board Tables | Relational DB Tables | [Exercise](exercises/databases/table_for_message_board_system.md) | [Solution](exercises/databases/solutions/table_for_message_board_system.md) + +
+What is a relational database?
+ + * Data Storage: system to store data in tables + * SQL: programming language to manage relational databases + * Data Definition Language: a standard syntax to create, alter and delete tables +
+
What does it mean when a database is ACID compliant?
@@ -14151,6 +14305,53 @@ A connection leak is a situation where database connection isn't closed after be A database index is a data structure that improves the speed of operations in a table. Indexes can be created using one or more columns, providing the basis for both rapid random lookups and efficient ordering of access to records.
+
+What data types are there in relational databases?
+
+ +
+Explain Normalization
+ +Data that is used multiple times in a database should be stored once and referenced with a foreign key.
+This has the clear benefit of ease of maintenance where you need to change a value only in a single place to change it everywhere. +
+ +
+Explain Primary Key and Foreign Key
+ +Primary Key: each row in every table should a unique identifier that represents the row.
+Foreign Key: a reference to another table's primary key. This allows you to join table together to retrieve all the information you need without duplicating data. +
+ +
+What types of data tables have you used?
+ + * Primary data table: main data you care about + * Details table: includes a foreign key and has one to many relationship + * Lookup values table: can be one table per lookup or a table containing all the lookups and has one to many relationship + * Multi reference table +
+ +
+What is ORM? What benefits it provides in regards to relational databases usage?
+ +[Wikipedia](https://en.wikipedia.org/wiki/Object%E2%80%93relational_mapping): "is a programming technique for converting data between incompatible type systems using object-oriented programming languages" + +In regards to the relational databases: + + * Database as code + * Database abstraction + * Encapsulates SQL complexity + * Enables code review process + * Enables usage as a native OOP structure +
+ +
+What is DDL?
+ +[Wikipedia](https://en.wikipedia.org/wiki/Data_definition_language): "In the context of SQL, data definition or data description language (DDL) is a syntax for creating and modifying database objects such as tables, indices, and users." +
+ ## Regex Given a text file, perform the following exercises diff --git a/exercises/databases/solutions/table_for_message_board_system.md b/exercises/databases/solutions/table_for_message_board_system.md new file mode 100644 index 000000000..dece83fe5 --- /dev/null +++ b/exercises/databases/solutions/table_for_message_board_system.md @@ -0,0 +1,46 @@ +## Database Table for Message Board System + +### Instructions + +Design a database table for a message board system. It should include the following information: + + * Personal details + * Who saw the message and when + * Replies + * Tagged people in the message + * Message categories + +Notes: + + * No SQL is needed + * You should include: table names, field names, data types and mention the foreign keys used. + +### Solution + +Note: This is just one possible design +2nd Note: PK = primary key, FK = Foreign key + + ----- People ----- + ID int PK + FirstName varchar(255) + LastName varchar(255) + DOB date + Gender varchar(1) + Phone varchar(10) + + | \ + | \ + | \ + v \ + \ + --- Messages --- v + ID int PK + MessageBoardID FK --- MessageTags --- +--- MessageBoards --- PeopleID int FK ID int PK +ID int PK ----> MsgDate datetime ---> MessageID FK +Board text Message text PeopleID int Fk + MessageID (FK) + ^ | + | | + |______| + diff --git a/exercises/databases/table_for_message_board_system.md b/exercises/databases/table_for_message_board_system.md new file mode 100644 index 000000000..9ba3f256f --- /dev/null +++ b/exercises/databases/table_for_message_board_system.md @@ -0,0 +1,16 @@ +## Database Table for Message Board System + +### Instructions + +Design a database table for a message board system. It should include the following information: + + * Personal details + * Who saw the message and when + * Replies + * Tagged people in the message + * Message categories + +Notes: + + * No SQL is needed + * You should include: table names, field names, data types and mention the foreign keys used.