Parking-space-allocation

This project is about automatically detecting whether a vehicle is parked in the parking spot or not. Also, we count the number of vacant and occupied spaces.

Since, it is a classic object detection problem, to generate a vanilla baseline solution I chose a pretrained model from Detectron2 modelzoo. The model is trained on COCO dataset which means, our model is ready to detect cars, buses and trucks.

Installation

The entire project can deployed on local server. I have written a REST API in Flask for it. To use the model,

Download the repository.
Install dependencies from requirements.txt
Fire up a terminal inside the repo and run python app.py

You are good to go. Upload an image and you'll see the results.

Todo

Add documentation for automatic detection of parking spaces
Add Vectorised operations of calculating IOU

The code for the above two steps has already been written and tested in the 'parking_space_allocation.ipynb' jupyter notebook. Explanation and documentation remains.

Note to the readers

You are invited to propose changes, contribute or collaborate in this project.

Approach

Detecting vehicles is no more a challenge especially with off-the-self object detection models. Since, cars are already present in COCO dataset we don't even need to fine tune the model to our custom classes. Once we detect the vehicle, we can measure the amount of overlap between the parking spot and the car. If the amount of overlap is above a threshold, we signal vehicle is parked in the parking spot.

The challenge is to define what exactly a parking spot is ? How can the model detect a parking place ? Different parking lots have different types parking spots. Can there be a solution where we detect parking spots in any parking place and if yes what would be the features of a parking spot?

We will come to these questions later.

For now, we are going to manually label all the parking spots in the image.

Data

There is a huge dataset (approx. 10 GB) of parking lot images on Kaggle. But we don't need that large a dataset for our problem. The baseline solution we are creating has some constraints:

For every parking lot, we have to manually draw boxes around parking spots.
All the test images of the parking lot must be from a single stationary camera.

Here is the link to the dataset: Parking Lot dataset

There are three parking lots in the dataset. I chose one of them.

Next thing in line is to label the parking spots. We can go ahead and manually label the parking spots in the image with a labelling tool. I was about to do the same but I found that each image in the dataset is associated with a label in xml format. Each image has 40 predefined parking spots. Depending upon whether a car is parked there or not, these are labelled as vacant or occupied alongwith the coordinates of the rectangles that cover the parking spots.

I am more comfortable with json format when it comes to labels and metadata, so I converted the xml labels to json. You can do that online on Code beautifier.

I wrote the function below to extract all polygons bounding the parking spots and display them onto an image.

def generate_label_polygons(img_path, label_path):
  data = json.load(codecs.open(label_path, 'r', 'utf-8-sig')) 
  img = cv2.imread(img_path)
  parking_spaces = data['parking']['space']
  for parking_space in parking_spaces:
    coordinates = parking_space['contour']['point']
    npts = []
    for item in coordinates:
      x, y = int(item['_x']), int(item['_y'])
      npts.append([x, y])
    img = draw_poly(img, npts)
  return img

One interesting thing to notice here is that, we have irregular quadrilaterals here, whereas object detection models predict rectanglular boxes. So, if we are going to calculate overlap between the two we need to turn the parking spots into rectangles.

def generate_label_bboxes_img(img_path, label_path):
  img = cv2.imread(img_path)
  data = json.load(codecs.open(label_path, 'r', 'utf-8-sig')) 
  parking_spaces = data['parking']['space']
  bbox_arr = []
  bbox = []
  for parking_space in parking_spaces:
    coordinates = parking_space['contour']['point']
    xs = []
    ys = []
    for item in coordinates:
      x, y = int(item['_x']), int(item['_y'])
      xs.append(x)
      ys.append(y) 
    xmin, ymin, xmax, ymax = min(xs), min(ys), max(xs), max(ys)
    img = cv2.rectangle(img,(xmin, ymin), (xmax, ymax),(0,255, 0), 2)
  return img

Here is how it looks

Transformation looks great, but we need to store these boxes into an object and instead of returning the image we need to return the object

def generate_label_bboxes(label_path):
  data = json.load(codecs.open(label_path, 'r', 'utf-8-sig')) 
  parking_spaces = data['parking']['space']
  bbox_arr = []
  bbox = []
  for parking_space in parking_spaces:
    coordinates = parking_space['contour']['point']
    xs = []
    ys = []
    for item in coordinates:
      x, y = int(item['_x']), int(item['_y'])
      xs.append(x)
      ys.append(y)
    xmin, ymin, xmax, ymax = min(xs), min(ys), max(xs), max(ys)
    bbox.append([xmin, ymin, xmax, ymax])
  bbox_arr = np.asarray(bbox)
  torch_bbox = tensor(bbox_arr)
  return torch_bbox

It now returns a Pytorch tensor torch_bbox containing all the parking space boxes

Next, we go ahead and detect cars with our model.

Model

We use FAIR's (Facebook AI Research) Detectron2 api for our problem. It has a number of pretrained models. I am using Faster RCNN architecture in this project. Using the function below first we load the model

def setup_model():
	cfg = get_cfg()
	cfg.merge_from_file(model_zoo.get_config_file("COCO-Detection/faster_rcnn_X_101_32x8d_FPN_3x.yaml"))
	cfg.MODEL.RETINANET.NMS_THRESH_TEST = 0.5 # lower value decreases the number of reduntant boxes
	cfg.MODEL.RETINANET.SCORE_THRESH_TEST = 0.5
	cfg.MODEL.WEIGHTS = model_zoo.get_checkpoint_url("COCO-Detection/faster_rcnn_X_101_32x8d_FPN_3x.yaml")
	predictor = DefaultPredictor(cfg)
	return predictor, cfg

It returns a predictor to which we can pass numpy images as argument.

Let's write a function to visualize the output of predictor

def visualize_preds(outputs, cfg,  im):
  v = Visualizer(im[:, :, ::-1],
             MetadataCatalog.get(cfg.DATASETS.TRAIN[0]),
             scale=1.0,
             instance_mode = ColorMode.SEGMENTATION
             )
  v = v.draw_instance_predictions(outputs["instances"].to("cpu"))
  img_out = v.get_image()[:, :, ::-1]
  return img_out

The above code is written around Detectron2 api. It returns a numpy image with boxes drawn around objects. We call this function and display the image

img = cv2.imread(img_path)
predictor, cfg = setup_model()
outputs = predictor(img)
img_out = visualize_preds(outputs, cfg, img)
cv2.imshow(img_out)

Results look great, but this model as of now predicts all the classes in the COCO dataset. But we just want vehicles. Let's assume our usecase asks us to detect only the parked cars. But our model is predicting trucks too.

So, we write code to choose only the cars. For that we need to know the numeric id of cars in COCO dataset.

def gen_car_bboxes(im, predictor):
  outputs = predictor(im)
  # numeric id of cars in COCO dataset is 2
  a = outputs["instances"].pred_classes
  indices = (a==2).nonzero().flatten()
  output_cars = custom_output(outputs, indices, im)

  return output_cars

As expected, the model now ignores trucks and only cars are detected.

Finally, we can use the work above to write a function to return predicted boxes as Pytorch tensors

def gen_bbox_predictions(im, predictor):
  outputs = predictor(im)

  a = outputs["instances"].pred_classes
  indices = (a==2).nonzero().flatten()
  op = custom_output(outputs, indices, im)

  preds = op['instances'].pred_boxes
  torch_preds = preds.tensor
  torchint_preds = torch_preds.type(torch.IntTensor)
  print('tip type', type(torchint_preds))
  return torchint_preds

I then find how many cars are overlapping to how many boxes and to what extent. If overlap is good enough, we predict the car is parked.

And this bring us to our next bit, how to measure the amount of overlap effectively?

Calculating overlap

As far as overlap calculation is concerned, IOU is the goto solution in deep learning world. So, what is IOU?

Intersection Over Union, abbreviated as IOU is an evaluation metric to measure the extent of overlap between two rectangular boxes. The amazing Adrian Rosebrock explains it on his blog about IOU.

The code below is also taken from Adrian's website from the same blog.

def bb_intersection_over_union(boxA, boxB):
  # determine the (x, y)-coordinates of the intersection rectangle
  xA = max(boxA[0], boxB[0])
  yA = max(boxA[1], boxB[1])
  xB = min(boxA[2], boxB[2])
  yB = min(boxA[3], boxB[3])
  # compute the area of intersection rectangle
  interArea = max(0, xB - xA + 1) * max(0, yB - yA + 1)
  # compute the area of both the prediction and ground-truth
  # rectangles
  boxAArea = (boxA[2] - boxA[0] + 1) * (boxA[3] - boxA[1] + 1)
  boxBArea = (boxB[2] - boxB[0] + 1) * (boxB[3] - boxB[1] + 1)
  # compute the intersection over union by taking the intersection
  # area and dividing it by the sum of prediction + ground-truth
  # areas - the interesection area
  iou = interArea / float(boxAArea + boxBArea - interArea)
  # return the intersection over union value
  return iou

IOU function takes in two boxes (a parking spot and a model predicted box) and gives iou of them. If a parking spot has greater iou than the threshold for any of the predicted boxes, we draw it on the image with green color otherwise with the red color.

Basically for every parking spot we loop over every predicted box and store the IOUs of each parking spot in a list. Choose the maximum iou, and if iou is greater than the threshold, then the spot is occupied. The idea of loop inside loop sounds extremely inefficient but we will worry about that later.

def draw_output(torchint_preds, torch_bbox, img, iou_threshold = 0.3):
  #loop over all the predefined parking spot boxes
  for label in torch_bbox:
      iou_list = []
      #loop over model predicted boxes
      for pred in torchint_preds:
        iou = bb_intersection_over_union(label, pred)
        iou_list.append(iou)
        iou_max = max(iou_list)
        if iou_max > iou_threshold:
          #if spot is occupied, draw a green box
          color = (0,255,0)
        else:
          #if spot is empty, draw a red box
          color = (0,0,255)
      img = cv2.rectangle(img,(label[0], label[1]), (label[2], label[3]), color, 2)
  return img

iou_threshold is a hyperparameter to tune here.

Visualising results

Now that our bits and pieces of code are ready, all that remains is to put this pieces together, run the code and visualize final results

predictor, cfg = model.setup_model()
torch_bbox = model.generate_label_bboxes_via()
torchint_preds = model.gen_bbox_predictions(img, predictor)
image_out = model.draw_output(torchint_preds, torch_bbox, img)
cv.imshow(image_out)

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
__pycache__		__pycache__
detectron2_repo		detectron2_repo
images		images
labels		labels
static		static
templates		templates
Parking_Space_allocation.ipynb		Parking_Space_allocation.ipynb
PklotInfo.pdf		PklotInfo.pdf
README.md		README.md
app.py		app.py
helper.py		helper.py
model.py		model.py
process_video.py		process_video.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parking-space-allocation

Installation

Todo

Note to the readers

Approach

Data

Model

Calculating overlap

Visualising results

About

Releases

Packages

Languages

NavneetSajwan/Parking-space-allocation

Folders and files

Latest commit

History

Repository files navigation

Parking-space-allocation

Installation

Todo

Note to the readers

Approach

Data

Model

Calculating overlap

Visualising results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages