jspωiki

## Overview#

Bitmask (BIT STRING, Bitmap) is a method of Data representation using Binary

Bitmasks lend themselves to Bitwise operations

### MOTIVATION#

Suppose you have a set of objects and you want some way to represent which objects to pick and which ones not to pick.

How do you represent that in in a program? More generally, how do you represent a subest of a set?

One way is to use a Map to associate with each object a boolean value indicating whether the object is picked. Alternatively,if the object can be indexed by integers, you can use a boolean array. However, this takes up a lot of memory and can be slow due to the overhead of Map and array. If the size of the set is not too large, a bitmask is much more efficient (and convenient)!

Bitmask a.k.a. lightweight, small sets of Booleans (native support in C/C++/Java).

An integer is stored in a computer’s memory as a sequence/string of bits. Thus, we can use integers to represent a lightweight small set of Boolean values. All set operations then involve only the Bitwise operations of the corresponding integer, which makes it a much more efficient choice when compared with the C++ STL vector, bitset, or set options. Such speed is important in competitive programming.

We know an integer is just a bunch of bits stringed together. The 1st bit will represent whether the 1st object is picked, the 2nd bit will represent whether the 2nd object is picked or not, etc. For example, suppose in a set of 5 objects, we have picked the 1st , 3rd , and 4th object. The Bitmask to represent this in binary is 01101 or 13 in decimal (in the notes, the 1 st bit will always be the least significant bit and will always appear at the very right).

REPRESENTATION: A 32 (or 64)-bit Signed int for up to 32 (or 64) items. ( To avoid issues with the two’s complement representation, use a 32-bit/64-bit Signed int to represent bitmasks of up to 30/62 items only, respectively ).
```    For example:                          5| 4  | 3 | 2 | 1 | 0   <- 0-based indexing from right
32| 16 | 8 | 4 | 2 | 1   <- power of 2
A= 34 (base 10) =   1 | 0  | 0 | 0 | 1 | 0   <- in binary
F | E  | D | C | B | A   <- alternative alphabet label
```
In the example above,the integer A = 34 or 100010 in binary also represents a small set {1, 5} with a 0-based indexing scheme in increasing digit significance ( or {B, F} using the alternative alphabet label) because the second and the sixth bits (counting from the right) of A are on ( 1 ).

To multiply/divide an integer by 2: We only need to shift the bits in the integer left/right, respectively. Notice that the truncation in the shift right operation automatically rounds the division-by-2 down, e.g. 17/2 = 8.

```    For example:         A = 34 (base 10)                  = 100010 (base 2)
A = A << 1 = A * 2 = 68 (base 10) = 1000100 (base 2)
A = A >> 2 = A / 4 = 17 (base 10) = 10001 (base 2)
A = A >> 1 = A / 2 = 8 (base 10) = 1000 (base 2) <- LSB( Least Significant Bit )is gone
```

Add the jth object to the subset (set the jth bit from 0 to 1): use the bitwise OR operation A |= (1 << j).

```     For example:     A = 34 (base 10) = 100010 (base 2)
j = 3, 1 << j    = 001000 <- bit ‘1’ is shifted to the left 3 times
-------- OR (true if either of the bits is true)
A = 42 (base 10) = 101010 (base 2) // update A to this new value 42
```

Remove the jth object from the subset (set the jth bit from 1 to 0): use the bitwise AND operation A &= ∼(1 << j).

```     For example:         A = 42 (base 10) = 101010 (base 2)
j = 1, ~(1 << j) = 111101 <- ‘~’ is the [bitwise NOT] operation
-------- AND
A = 40 (base 10) = 101000 (base 2) // update A to this new value 40
```

Check whether the jth object is in the subset (check whether jth bit is 1): use the bitwise AND operation T = A & (1 << j).

• If T = 0, then the j-th item of the set is off.
• If T != 0 (to be precise, T = (1 << j)), then the j-th item of the set is on.
```   For example:    A = 42 (base 10) = 101010 (base 2)
j = 3, 1 << j    = 001000 <- bit ‘1’ is shifted to the left 3 times
-------- AND (only true if both bits are true)
T = 8 (base 10)  = 001000 (base 2) -> not zero, the 3rd item is on
```

To toggle (flip the status of) the j-th item of the set: use the Bitwise operation Bitwise XOR operation A ∧ = (1 << j).

```   For example:       A = 40 (base 10) = 101000 (base 2)
j = 2, (1 << j)  = 000100 <- bit ‘1’ is shifted to the left 2 times
-------- XOR <- true if both bits are different
A = 44 (base 10) = 101100 (base 2) // update A to this new value 44
```
To get the value of the least significant bit that is on (first from the right): use T = (A & (-A)).
```   For example:     A =  40 (base 10) = 000...000101000 (32 bits, base 2)
-A = -40 (base 10) = 111...111011000 (two’s complement)
----------------- AND
T =   8 (base 10) = 000...000001000 (3rd bit from right is on)
```

To turn on all bits in a set of size n: (be careful with overflows): use A = (1 << n) - 1 ;

Iterate through all subsets of a set of size n: for ( x = 0; x < (1 << n); ++x )

Iterate through all subsets of a subset y (not including empty set): for ( x = y; x > 0; x = ( y & (x-1) ) )

Example of a subset problem: given a set of numbers, we want to find the sum of all subsets.

Sol: This is easy to code using bitmasks. we can use an array to store all the results.

```int sum_of_all_subset ( vector< int > s ){
int n = s.size() ;
int results[ ( 1 << n ) ] ;     // ( 1 << n )= 2^n

//initialize results to 0
memset( results, 0, sizeof( results ) ) ;

// iterate through all subsets

for( int i = 0 ; i < ( 1 << n ) ; ++ i ) {    // for each subset, O(2^n)
for ( int j = 0; j < n ; ++ j ) {       // check membership, O(n)
i f ( ( i & ( 1 << j ) ) ! = 0 )    // test if bit ‘j’ is turned on in subset ‘i’?
results[i] += s [j] ;          // if yes, process ‘j’
}
}
}
```

### limitations#

Always check the size of the set to determine whether to use an int or long or not using Bitmask at all

Always use parentheses to indicate the precedence of operations when doing bitwise operations! When it involves bitwise operators and not putting parenthesis can yield undesirable results!

For example,

```let x = 5. Then x - 1 << 2 = 16, but x - (1 << 2) = 1
```