Brijesh Patel | Developer

PROJECTS

Cervical cancer tumour segmentation using deep learning

This was my Master's project conducted at the University of Manchester with collaboration with The Christie NHS Foundation Trust. The first part involved setting up a project pipeline to read in MRI image files (in NIfTI file format), pre-process them and storing these images. These were then used for large feature extraction using a package called radiomics.

The second part involved using the above pipeline, and a U-Net style convolutional neural network (CNN) to employ deep learning; this was used to segment two main defined cancerous regions - the gross tumour volume (GTV) which indicates the visible, macroscopic spread, and clinical tumour volume (CTV), which includes the GTV plus some of the surrounding area (containing possible microscopic spread).

GTV segmentation had also never done before using deep learning for cervical cancer at the time. The technique employed transfer learning using the ImageNet database (due to the limited size of the dataset of cervical cancer images) and in the end, accuracies of 55-80% were obtained in identifying the correct GTV/CTV regions. This project was predominantly Python-based though some MATLAB and Lua code had to be written as well. For future work, it would also be possible to repeat this technique on other sets of clinical images.

View Code

Top figure and video shows the contours predicted for the GTV regions of a test MRI dataset. The DICE coefficients calculated indicate level of overlap between predictions and actual delineations with a value of 1 indicating maximum overlap. Middle figure shows the GTV (red) and CTV (yellow) delineations used for training. Bottom figure shows the overlap between the predictions (orange) and real (yellow) for the CTV region.

Synthetic data generation, exploration and analysis

This project involved reviewing techniques in generating synthetic data, driven by the need of sharing sensitive data for R&D purposes. The synthetic data generation tools aimed to mimic characteristics of the real dataset whilst removing or obscuring the sensitive information present within the real data. The final report was made public here on Gov.UK.

For example, we might want the synthetic data to retain the range of values of the original data with similar (but not the same) outliers. Or we might want to retain a similar frequency distribution in the synthetic and original datasets. However, this becomes more complex when we start to consider interactions between fields, or different types of data such as free text and GPS locations.

Working in a team, I helped explore public datasets which were similar to those used by the customer and identified tools which could be used to produce synthetic data. For example, the Czech financial database comprised of numerical/categorical data was used with the Synthetic Data Vault (sdv) Python package whilst the text-based resumes/CSVs were used with Microsoft's open-source tool called "Presidio" (which was also powered by AI algorithms).

SDV worked with relational tables and from the testing and analysi using cross-correlation heat maps and histograms, the tool produced mixed results since it only seemed to work with Gaussian-distirbuted data. This often led to odd results such as negative bank ID values - however, since the tool is open-source, it's still currently in development and the ability to specify other distributions may be added in at a later date.

The final report can be viewed here on Gov.UK here, whilst a blog article regarding this investigation can be found here.

Figure shows the summary of approach indicating the components and data flow (coloured arrows), as well showing how the synthetically-generated data is evaluated.

This figure shows the evaluation framework for synthetic data generators, consisting of six indicators.

A comparison of how original, sensitive data would compare to synthetically-generated data.

For details of other projects, please contact me using the email below.

Fictional University

An interactive Wordpress website built by producing a custom theme for a fictional university.

Built using PHP, leverages the REST API, and uses javascript (including JQuery) to communicate with the back-end
Has a live search function, plus a traditional javascript fallback in case of any issues
Uses Custom Post Types and Advanced Custom Fields, plus the OpenStreetMap API to show campus locations
Creates relationships between pieces of content (e.g. professor and course)
Allows you to create different user groups with different permissions (e.g. subscriber, editor, admin)
Lets visitors sign up for a basic account on the site where they can store notes
My Notes feature allows users to manage notes using CRUD (create, read, update, delete)
Allows users to "like"/"heart" a professor and updates the professor's like count in realtime
Consideration given to improve security using escape functions within the PHP code

See Live View Code

Github Portfolio

My Github portfolio for my web development and data science projects, which uses the Bootstrap framework.

Mostly built using HTML and CSS, with Bootstrap for styling elements and to produce a responsive design
Simple animations included using CSS plus "wow.js" and "animate.css" imports
Images and videos optimised for quicker reloading, with fallbacks implemented in the event of a slow connection

See Live View Code

FastFoodOne

A fast food restaurant website which also has table reservations.

Created using the Elementor page builder
Uses well-known plugins such as Contact Form 7 and Max Mega Menu (to create a navigation bar)

See Live

RestaurantOne

A restaurant website which allows for table reservations and online delivery.

Created using the Themify page builder
Uses plugins such as WPForms (to produce a contact form) and the Gloria Food Reservations and Ordering API plugin (for reservations and online delivery)

See Live

To Do List

A fun to do list produced using HTML, CSS and Javascript.

See Live View Code

Hi, I'm Brijesh.

Welcome to my portfolio.

ABOUT ME

PROJECTS

Cervical cancer tumour segmentation using deep learning