types a type consists of –a set of values –a set of operations on those values types can be...

Post on 21-Dec-2015

220 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Types

• A type consists of– a set of values– a set of operations on those values

• Types can be– primitive (atomic, non-decomposable)– composite (includes recursive types)

• Types can be– built-in– user-defined

Primitive types

• Typical primitive types– Boolean = {true,false}– Character = {…,’a’,…’z’,…’0’,…,’9’,…}– Integer = {0, 1, -1, 2, -2, … }– Float = {0.0, …, 1.0, -1.0, …, 3.1415, …}

• Set of primitive types is language-dependent• Value of primitive types typically

implementation-dependent, though can be written into language spec (e.g. Java, C#)

Gotcha’s

• Boolean is not always a distinct type– C++ bools are really ints

• Character is not always a distinct type– C, C++, Java are really just (unsigned) ints

• Languages sometimes provide many variants on a primitive type:– Java’s Integer types: byte, short, int, long

User-defined primitives

• Enumerations allow programmers to create their own primitive (non-decomposable) types:– C++

enum day {Sunday, Monday, Tuesday, Wednesday, Thursday, Friday, Saturday};

Monday prints as 1

– Javaenum Day {Sunday, Monday, Tuesday, Wednesday, Thursday, Friday, Saturday}

Monday prints as Mondaymethods can be attached to enumerands

Composite types

• Cartesian products– tuples, records, structures

• disjoint unions– union, discriminated or variant records

• mappings– arrays, functions

• recursive types– lists, trees

Cartesian products

• S x T = {(x,y) | x in S, y in T}

• Basic operations:– construction of a pair– selection of a component

• Cardinality (#) of a Cartesian product:– #(S x T) = #S * #T

• Can be extended naturally beyond pairs to general n-tuples

C structures

enum Month {jan, feb, mar, apr, may, jun, jul, aug, sep, oct, nov, dec};

struct Date {

Month m;

int d;

};

Date today = {mar,6};

Pascal records

month = (jan, feb, mar, apr, may, jun, jul, aug, sep, oct, nov, dec);

date = record

m: month;

d: 1..31;

end;

today : date;

today.m := mar;

today.d := 6;

ML records

Form of record:{ <label>:<type>, <label>:<type>, ... <label>:<type> }

Accessing a member: #<label>(<record>)

datatype month = jan | feb | mar | apr | may | jun | jul | aug | sep | oct | nov | dec;

val today = { m=mar, d=6 };

ML tuples

• A tuple in ML is a special type of record, whose field labels are integers starting from 1.

• (“fred”,false) is a tuple whose type is reported as string * bool. This is underlyingly the record { 1=“fred”, 2=false }, but the (…) are special syntactic sugar.

Tuples and records in ML

• val a = (“fred”, false) has type string*bool.

• val b = {1=“fred”, 2=false) has type string*bool, and is printed as (“fred”,false).

• val c = {one=“fred”, two=false} has type {one:string, two:bool} and is printed as {one=“fred”, two=false}.

Making a date in ML

datatype month = jan | feb | mar | apr | may | jun | jul | aug | sep | oct | nov | dec;

datatype date = Date of month * int;

val today = Date(mar,6);

Date is a new type which is built using a tuple, a special type of record.

Accessing members of a datatype

val today = Date(mar,6);

val Date(x,y) = today;

This binds x to mar and y to 6!

Homogeneous tuples

• What are they?– S2 = S x S– Sn = S x S x … x S

• Cardinalities– #(S2)= (#S)2

– #(Sn)= (#S)n

– #(S0)= (#S)0=1

• What is S0? – It is the type Unit, whose value is ().– Unit corresponds to void in C, C++ and Java.

Mappings

• Types of mappings:– Arrays– Functions

• Arrays map integers to the declared type of the variables in the array.

• Functions map an arbitrary domain type to an arbitrary range type.

Mappings in general

• S = {0,1}

• T = {a,b,c}

• How many mappings ST are there?

• #(ST) = #(T)#(S)

• The possible ST mappings are:

{0a,1a}, {0a,1b}, {0a,1c}, {0b,1a}, {0b,1b}, {0b,1c}, {0c,1a}, {0c,1b}, {0c,1c}

Implementing a finite mapping as an array

T[] map = {a,c};

defines map to be the third mapping given on previous slide:

map[0] = a

map[1] = c

Implementing a finite mapping as a function

We can also define the mapping as a function (ML example):fun map 0 = a | map 1 = c;

or (in Pascal): function map(s:S):T;

begin case (s) of

0: map:=a;1: map:=c;

end;

More discussionWe can also define the mapping as a function (ML example):

fun map x = case x of

0 => a1 => c

(Notice function syntax in case expression!)

In C:T map(int x) {

switch (x) { case 0: return a; case 1: return c; }

}

More general finite mappings

Hash tables let us define more general finite mappings (domain type does not need to be integral).

Memoization

• Combination of computation and storage!

Array-based vs. function-based mappings

• An array-based mapping is explicit:– it gives domain-range pairs explicitly

• A function-based mapping is implicit:– for each domain element it gives a method for

computing the corresponding range element.

Disjoint Unions

• Pascal: variant record

• C/C++: union

• ML: union type

• Ada: discriminated record

Disjoint Unions(quick review)

• Pascal: variant record

• C/C++: union

• ML: union type

• Ada: discriminated record

What is a disjoint union?

• “Another kind of composite value is the disjoint union, whereby a value is chosen from one of several (usually different) sets.”

David Watt

Programming Language Design Concepts page 27

S+T

Values are tagged to indicate which set they are from:

S+T = { left(x) | x in S } { right(y) | y in T }

Cardinality? #(S+T) = #S + #T

MLdatatype number = Exact of int | Inexact of real;

datatype person = King

| Peer of string*string*int

| Knight of string

| Peasant of string;

Values: King, Peer(“Earl”, “Carlisle”, 7),

Knight(“Gawain”), Peasant(“Jack Cade”)

Paulson, ML for the Working Programmer

pages 124-125

OO disjoint unions(new material)

• The main purpose of a disjoint union is to bring together values from different sets in a way that you can determine from which set a given value is drawn.

• This type of structure is used very commonly in OO programming.

• Polymorphism feeds off of this idea: implicit selection based on type.

OO disjoint union example

Disjoint union

Recursive types

• A type defined in terms of itself• Examples: lists, strings, trees, …• If R is a recursive type, #(R) is infinite.• Members of a recursive type are often finite,

tough infinite members are possible (the list of all primes is a list, but it is an infinite list).

• How? They are implicitly represented (think of function vs. array representation of mappings): e.g. lazy lists.

Recursive type membership• Consider the list type:datatype ’a List = Nil

| Cons of ’a * ’a List;

• What are the members of this type?List(0) = {Nil} (lists of length zero)

List(1) = {Cons(x,y)| y in List(0)} (lists of length one)

…List(k) = {Cons(x,y)| y in List(k-1)} (lists of length k)

List = Un=0…∞ List(n) (lists of all lengths, zero or greater)

C++ & pointer types

• Languages like C++ do not directly support recursive types, but rather require the use of an indirection mechanism, the pointer.

• This has to do with the semantics of assignment. If v is a recursive structure, what are reasonable semantics for w=v?– copy semantics – expensive – reference semantics – sharing

• If language does not permit selective updating of a structure (as in a pure functional language) the semantics are indistinguishable).

Type equivalence

Type equivalence addresses the question of when two types are considered equivalent.

– Structural equivalence• two types are equivalent if and only if they have

the same set of values

– Name equivalence• two types are equivalent if and only if they are

defined in the same place

Structural equivalence

• Types S and T are equivalent iff– S and T are both primitive and S and T are identical– S=AxB and T=CxD are both cartesian products and A

and C are equivalent and B and D are equivalent– S=AB and T=CD are both mappings and A and C

are equivalent and B and D are equivalent– S=A+B and T=C+D are both disjoint unions and A

and C are equivalent and B and D are equivalent

• Otherwise, S and T are not equivalent.

Name equivalence

• Strictly, types are unique – each new type definition occurs in a different place, and so results in a new distinct type.

• Pascal technically uses name equivalence, but in practice the rules are relaxed:– For example, suppose we wanted to define a file type

to permit two programs to communicate via a file. Since the file type is defined in two different places (in two different programs) their files types are considered different (and hence incompatible).

Name vs. Structural equivalence

• Name equivalence forces each distinct type to be defined in one and only one place. This is sometimes inconvenient, but it helps to make the program more maintainable. (If the same type is defined in several places, and subsequently it has to be changed, the change must be made consistently in several places.)

• Structural equivalence allows confusion between types that are only coincidentally similar.

Watt, page 42

Type Completeness Principle

“No operation should be arbitrarily restricted in the types of its operands.”

Watt, page 43Types whose values are unrestricted in their usage are termed “first-class”, whereas those that are restricted are termed “second-class”.In Pascal procedures can be passed as arguments to other procedures, but cannot, for example, be part of a composite value. Pascal procedures are therefore second class.A language like ML imposes no such arbitrary type distinctions. All types are first-class.

top related