Skip to content

Commit

Permalink
Adds style guide and naming conventions (#93)
Browse files Browse the repository at this point in the history
Adds a style guide for CN style and naming conventions to distinguish between
CN identifiers and C identifiers.

There will be changes to the style guide shortly.  See the PR for more details:
#84

Co-authored-by: Liz Austell <[email protected]>
  • Loading branch information
2 people authored and yav committed Oct 15, 2024
1 parent f284ac2 commit 7c0a52c
Show file tree
Hide file tree
Showing 146 changed files with 2,698 additions and 1,904 deletions.
5 changes: 5 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,7 @@
MAKEFILE_DIR:=$(dir $(realpath $(lastword $(MAKEFILE_LIST))))

default: build exercises build/tutorial.html build/exercises.zip
all: default

clean:
rm -rf build TAGS
Expand Down Expand Up @@ -43,6 +44,10 @@ build/solutions/%: src/examples/%
build/exercises.zip: $(EXERCISES)
cd build; zip -r exercises.zip exercises > /dev/null

WORKING=$(wildcard src/examples/list_*.c)
WORKING_AUX=$(patsubst src/examples/%, build/solutions/%, $(WORKING))
temp: $(WORKING_AUX) build
# build/tutorial.html

##############################################################################
# Check that the examples all run correctly
Expand Down
105 changes: 105 additions & 0 deletions NAMING-CONVENTIONS.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,105 @@
CN Naming Conventions
---------------------

This document describes our (Benjamin and Liz's) current best shot at
a good set of conventions for naming identifiers in CN, based on
numerous discussions and worked examples. Everything in the tutorial
(in src/examples) follows these conventions. Future CN coders are
encouraged to follow suit.

# Principles

- When similar concepts exist in both C and CN, they should be named
so that the correspondence is immediately obvious.
- In particular, the C and CN versions of a given data structure
should have very similar names.

- In text, we use the modifiers _CN-level_ vs. _C-level_ to
distinguish the two worlds.

# General conventions

## For new code

When writing both C and CN code from scratch (e.g., in the tutorial),
aim for maximal correspondence between

- In general, identifiers are written in `snake_case` (or
`Snake_Case`) rather than `camlCase` (or `CamlCase`).

- C-level identifiers are `lowercase` wherever possible.

- CN-level identifiers are `Uppercase_Consistently_Throughout`.

- A CN identifier that represents the state of a mutable data
structure after some function returns should be named the same as
the starting state of the data structure, with an `_post` at the
end.
- E.g., The list copy function takes a linked list `l`
representing a sequence `L` and leaves `l` at the end pointing
to a final sequence `L_post` such that `L == L_post`.
(Moreover, it returns a new sequence `Ret` with `L == Ret`.)

- Predicates that extract some structure from the heap should be named
the same as the structure they extract, plus the suffix `_At`.
E.g., the result type of the `Queue` predicate is also called
`Queue_At`.

## For existing code

In existing C codebases, uppercase-initial identifiers are often used
for typedefs, structs, and enumerations. We should choose a
recommended standard convention for such cases -- e.g., "prefix
CN-level identifiers with `CN` when needed to avoid confusion with
C-level identifiers". Some experimentation will be needed to see
which convention we like best; this is left for future discussions.

# Built-ins

This proposal may ultimately suggest changing some built-ins for
consistency.

- `i32` should change to `I32`, `u64` to `U64`
- `is_null` to `Is_null` (or `Is_Null`)

*Discussion*: One point against this change is that CN currently tries
to use names reminiscent of Rust (`i32`, `u64`, etc.). I (BCP) do not
personally find this argument persuasive -- internal consistency seems
more important than miscellaeous points of similarity with some other
language. One way or the other, this will require a global decision.

# Polymorphism

One particularly tricky issue is how to name the "monomorphic
instances" of "morally polymorphic" functions (i.e., whether to write
`append__Int` or `append__List_Int` rather than just `append`). On
one hand, `append__Int` is "more correct". On the other hand, these
extra annotations can get pretty heavy.

We propose a compromise:

1. If a project needs to use two or more instances of some polymorphic
type, then the names of the C and CN types, the C and CN functions
operating over them, and the CN predicates describing them are all
suffixed with `__xxx`, where `xxx` is the appropriate "type
argument". E.g., if some codebase uses lists of both signed and
unsigned 32-bit ints, then we would use names like this:
- `list__int` / `list__uint`
- `append__int` / `append__uint`
- `List__I32` / `List__U32`
- `Cons__I32` / `Cons__U32`
- etc.

2. However, if, in a given project, a set of "morally polymorphic"
type definitions and library functions is *only used at one
monomorphic instance* (e.g., if some codebase only ever uses lists
of 32-bit signed ints), then the `__int` or `__I32` annotations are
omitted.

This convention is used in the CN tutorial, for example.

*Discussion*: One downside of this convention is that it might
sometimes require some after-the-fact renaming: If a project starts
out using just lists of signed ints and later needs to introduce lists
of unsigned ints, the old signed operations will need to be renamed.
This seems like an acceptable cost for keeping things light.
2 changes: 1 addition & 1 deletion src/example-archive/SAW/broken/error-cerberus/00001.swap.c
Original file line number Diff line number Diff line change
Expand Up @@ -75,7 +75,7 @@ let argmin_spec len = do {
// NOTE: Another mistake I made was to use the override below.
// This works for size a_len, but on the second time through the loop the override
// doesn't match because the size of the list has changed! Ultimately need a
// doesn't match because the size of the ./headers.has changed! Ultimately need a
// loop here.
{-/
Expand Down
6 changes: 3 additions & 3 deletions src/example-archive/simple-examples/working/malloc_1.c
Original file line number Diff line number Diff line change
Expand Up @@ -3,19 +3,19 @@
// malloc() is not defined by default in CN. We can define a fake malloc()
// function that only works on ints.

int *my_malloc_int()
int *my_malloc__int()
/*@ trusted; @*/
/*@ ensures take New = Block<int>(return); @*/
{}

int *malloc_1()
int *malloc__1()
/*@ ensures
take New = Owned<int>(return);
New == 7i32;
*return == 7i32; @*/ // <-- Alternative syntax
{
int *new;
new = my_malloc_int();
new = my_malloc__int();
*new = 7; // Have to initialize the memory before it's owned
return new;
}
9 changes: 0 additions & 9 deletions src/examples/3.c

This file was deleted.

35 changes: 0 additions & 35 deletions src/examples/Dbl_Linked_List/add.c

This file was deleted.

27 changes: 0 additions & 27 deletions src/examples/Dbl_Linked_List/add_orig.broken.c

This file was deleted.

5 changes: 0 additions & 5 deletions src/examples/Dbl_Linked_List/c_types.h

This file was deleted.

6 changes: 0 additions & 6 deletions src/examples/Dbl_Linked_List/cn_types.h

This file was deleted.

22 changes: 0 additions & 22 deletions src/examples/Dbl_Linked_List/getters.h

This file was deleted.

12 changes: 0 additions & 12 deletions src/examples/Dbl_Linked_List/malloc_free.h

This file was deleted.

17 changes: 0 additions & 17 deletions src/examples/Dbl_Linked_List/node_and_int.h

This file was deleted.

36 changes: 0 additions & 36 deletions src/examples/Dbl_Linked_List/predicates.h

This file was deleted.

36 changes: 0 additions & 36 deletions src/examples/Dbl_Linked_List/remove.c

This file was deleted.

13 changes: 0 additions & 13 deletions src/examples/Dbl_Linked_List/singleton.c

This file was deleted.

Loading

0 comments on commit 7c0a52c

Please sign in to comment.