Data Structures in Python

Data structures and algorithm implementations in Python.

Objective

Practice implementing various data structures to gain experience designing data structures and algorithms in Python.

List ADT

Binary Search on Sorted List

Implementation for a simple binary search algorithm on a sorted list.
Code: Binary Search

Quicksort Unsorted List

Implementation for a simple Quicksort algorithm on an unsorted list.
Code: Quicksort

Linked List

Implementation for a simple LinkedList data structure.
Code: LinkedList

Stack ADT

Implementation for a simple stack data structure.
Code: Stack

Check Brackets In Text

Implement a feature for a text editor to find errors in the usage of brackets in the code.

Input: Input contains one string 𝑆 which consists of big and small latin letters, digits, punctuation marks and brackets from the set []{}().
Constraints: The length of 𝑆 is at least 1 and at most 10^5.
Output: If the code in 𝑆 uses brackets correctly, outputs “Success". Otherwise, outputs the 1-based index of the first unmatched closing bracket, and if there are no unmatched closing brackets, outputs the 1-based index of the first unmatched opening bracket.
Code: Check Brackets

Stack With Maximum

Stack is an abstract data type supporting the operations Push() and Pop(). Implement both these operations to work in constant time. Implement a stack that also supports finding the maximum value and to ensure that all operations still work in constant time.

Input: The first line of the input contains the number 𝑞 of queries. Each of the following 𝑞 lines specifies a query of one of the following formats: push v, pop, or max.
Constraints: 1 ≤ 𝑞 ≤ 40,0000; 0 ≤ 𝑣 ≤ 10^5.
Output: For each max query, output (on a separate line) the maximum value of the stack.
Code: Max Stack

Queue ADT

Implementation for a simple queue data structure.
Code: Queue

Maximum In Sliding Window

Given a sequence 𝑎1,...,𝑎𝑛 of integers and an integer 𝑚 ≤ 𝑛, find the maximum among {𝑎𝑖,...,𝑎𝑖+𝑚−1} for every 1 ≤ 𝑖 ≤ 𝑛 − 𝑚 + 1. A naive 𝑂(𝑛𝑚) algorithm for solving this problem scans each window separately. Design an 𝑂(𝑛) algorithm.

Input: The first line contains an integer 𝑛, the second line contains 𝑛 integers 𝑎1,...,𝑎𝑛 separated by spaces, the third line contains an integer 𝑚.
Constraints: 1 ≤ 𝑛 ≤ 10^5, 1 ≤ 𝑚 ≤ 𝑛, 0 ≤ 𝑎𝑖 ≤ 10^5 for all 1 ≤ 𝑖 ≤ 𝑛.
Output: Output max{𝑎𝑖,...,𝑎𝑖 + 𝑚 − 1} for every 1 ≤ 𝑖 ≤ 𝑛 − 𝑚 + 1.
Code: Max Sliding Window

Convert Array Into Heap

The first step of the HeapSort algorithm is to create a heap from the array you want to sort. Implement the first step and convert a given array of integers into a min-heap. Convert the array into a heap using only 𝑂(𝑛) swaps.

Input: First line contains a single integer 𝑛. Next line contains 𝑛 space-separated integers 𝑎𝑖.
Constraints: 1 ≤ 𝑛 ≤ 100,000; 0 ≤ i,j ≤ 𝑛-1; 0 ≤ 𝑎0,...,𝑎𝑛-1 ≤ 10^9.
Output: First line should contain a single integer 𝑚 (the total number of swaps) and 𝑚 must satisfy conditions 0 ≤ 𝑚 ≤ 4𝑛. The next 𝑚 lines should contain the swap operations used to convert the array 𝑎 into a heap.
Code: Build Heap

Parallel Processing

You have a program which is parallelized and uses 𝑛 independent threads to process the given list of 𝑚 jobs. Threads take jobs in the order they are given in the input. If there is a free thread, it immediately takes the next job from the list. If a thread has started processing a job, it doesn't interrupt or stop until it finishes processing the job. If several threads try to take jobs from the list simultaneously, the thread with smaller index takes the job. For each job you know exactly how long will it take any thread to process this job, and this time is the same for all the threads. You need to determine for each job which thread will process it and when will it start processing.

Input: The first line of the input contains integers 𝑛 and 𝑚. The second line contains 𝑚 integers 𝑡𝑖 — the times in seconds it takes any thread to process 𝑖-th job. The times are given in the same order as they are in the list from which threads take jobs. Threads are indexed starting from 0.
Constraints: 1 ≤ 𝑛 ≤ 10; 1 ≤ 𝑚 ≤ 10; 0 ≤ 𝑡𝑖 ≤ 10.
Output: Output exactly 𝑚 lines. 𝑖-th line (0-based index is used) should contain two space separated integers, the 0-based index of the thread which will process the 𝑖-th job and the time in seconds when it will start processing that job.
Code: Job Queue

Hashing

Phone Book

Implement a simple phone book manager that should be able to process the following types of user's queries:

add number name. The user adds a person with name name and phone number number to the phone book. If there exists a user with such number already, then the manager overwrites the corresponding name.
del number. The manager erases a person with number number from the phone book. If there is no such person, then it ignores the query.
find number. The user looks for a person with phone number number. The manager replies with the appropriate name, or with string “not found" if there is no such person in the book.

Input: There is a single integer 𝑁 in the first line (the number of queries). It’s followed by 𝑁 lines, each of them contains one query in the format described above.
Constraints: 1 ≤ 𝑁 ≤ 10^5. All phone numbers consist of decimal digits, they don’t have leading zeros, and each of them has no more than 7 digits. All names are non-empty strings of latin letters, and each of them has length at most 15. It’s guaranteed that there is no person with name “not found".
Output: Print the result of each find query — the name corresponding to the phone number or “not found" (without quotes) if there is no person in the phone book with such phone number. Output one result per line in the same order as the find queries are given in the input.
Code: Phone Book

Hashing With Chains

Implement a hash table with lists chaining given the number of buckets 𝑚 and the hash function. It is a polynomial hash function h(𝑆) = (∑^|𝑆|−1_𝑖=0 𝑆[𝑖]𝑥𝑖 mod 𝑝) mod 𝑚, where 𝑆[𝑖] is the ASCII code of the 𝑖-th symbol of 𝑆, 𝑝 = 1,000,000,007 and 𝑥 = 263. The program should support the following kinds of queries:

add string: insert string into the table. If there is already such string in the hash table, then just ignore the query.
del string: remove string from the table. If there is no such string in the hash table, then just ignore the query.
find string: output “yes" or “no" depending on whether the table contains string or not.
check i: output the content of the 𝑖-th list in the table. Use spaces to separate the elements of the list. If 𝑖-th list is empty, output a blank line.

When inserting a new string into a hash chain, you must insert it in the beginning of the chain.

Input: There is a single integer 𝑚 in the first line (the number of buckets you should have). The next line contains the number of queries 𝑁. It’s followed by 𝑁 lines, each of them contains one query in the format described above.
Constraints: 1 ≤ 𝑁 ≤ 10^5; 𝑁/5 ≤ 𝑚 ≤ 𝑁. All the strings consist of latin letters. Each of them is non-empty and has length at most 15.
Output: Print the result of each of the find and check queries, one result per line, in the same order as these queries are given in the input.
Code: Hash Chains

Find Substring Pattern

Implement the Rabin–Karp’s algorithm for searching the given pattern in the given text.

Input: There are two strings in the input: the pattern 𝑃 and the text 𝑇.
Constraints: 1 ≤ |𝑃| ≤ |𝑇| ≤ 5 · 10^5. The total length of all occurrences of 𝑃 in 𝑇 doesn't exceed 10^8. The pattern and the text contain only latin letters.
Output: Print all the positions of the occurrences of 𝑃 in 𝑇 in the ascending order. Use 0-based indexing of positions in the text 𝑇.
Code: Hash Substring

Substring Equality

Use hashing to design an algorithm that is able to preprocess a given string 𝑠 to answer any query of the form “are these two substrings of 𝑠 equal?” efficiently.

Input: The first line contains a string 𝑠 consisting of small Latin letters. The second line contains the number of queries 𝑞. Each of the next 𝑞 lines specifies a query by three integers 𝑎, 𝑏, and 𝑙.
Constraints: 1 ≤ |𝑠| ≤ 500000. 1 ≤ 𝑞 ≤ 100000. 0 ≤ 𝑎, 𝑏 ≤ |𝑠| − 𝑙.
Output: For each query, output “Yes” if s𝑎s𝑎+1...s𝑎+𝑙−1 = s𝑏s𝑏+1...s𝑏+𝑙−1 are equal, and “No” otherwise.
Code: Substring Equality

Longest Common Substring

In the longest common substring problem one is given two strings 𝑠 and 𝑡 and the goal is to find a string 𝑤 of maximal length that is a substring of both 𝑠 and 𝑡. This is a natural measure of similarity between two strings. The problem has applications in text comparison and compression as well as in bioinformatics. The problem can be seen as a special case of the edit distance problem (where only insertions and deletions are allowed). Hence, it can be solved in time 𝑂(|𝑠| · |𝑡|) using dynamic programming. In this problem, your goal is to use hashing to solve it in almost linear time.

Input: Every line of the input contains two strings 𝑠 and 𝑡 consisting of lower case Latin letters.
Constraints: The total length of all 𝑠’s as well as the total length of all 𝑡’s does not exceed 100000.
Output: For each pair of strings 𝑠 and 𝑡𝑖, find its longest common substring and specify it by outputting three integers: its starting position in 𝑠, its starting position in 𝑡 (both 0-based), and its length. More formally, output integers 0 ≤ 𝑖 < |𝑠|, 0 ≤ 𝑗 < |𝑡|, and 𝑙 ≥ 0 such that 𝑠𝑖𝑠𝑖+1···𝑠𝑖+𝑙−1 = 𝑡𝑗𝑡𝑗+1···𝑡𝑗+𝑙−1 and 𝑙 is maximal.
Code: Common Substring

Trees

Binary Search Trees

Code: Binary Tree Traversals
Code: Is BST
Code: Is BST Hard

Technologies

Python 3.8
PyCharm (Professional Edition) - Version: 2020.2.5
Visual Studio Code - Version: 1.51.1

License

This project is licensed under the terms of the MIT license. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
binarysearch.py		binarysearch.py
binarysearchtree.py		binarysearchtree.py
build_heap.py		build_heap.py
check_brackets.py		check_brackets.py
common_substring.py		common_substring.py
graph.py		graph.py
graphtraversal.py		graphtraversal.py
hash_chains.py		hash_chains.py
hash_substring.py		hash_substring.py
hashtable.py		hashtable.py
is_bst.py		is_bst.py
is_bst_hard.py		is_bst_hard.py
job_queue.py		job_queue.py
linkedlist.py		linkedlist.py
max_sliding_window.py		max_sliding_window.py
max_stack.py		max_stack.py
merging_tables.py		merging_tables.py
phone_book.py		phone_book.py
queue.py		queue.py
quicksort.py		quicksort.py
stack.py		stack.py
substring_equality.py		substring_equality.py
tree-orders.py		tree-orders.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Structures in Python

Table of Contents

Objective

List ADT

Binary Search on Sorted List

Quicksort Unsorted List

Linked List

Stack ADT

Check Brackets In Text

Stack With Maximum

Queue ADT

Maximum In Sliding Window

Convert Array Into Heap

Parallel Processing

Hashing

Phone Book

Hashing With Chains

Find Substring Pattern

Substring Equality

Longest Common Substring

Trees

Binary Search Trees

Technologies

License

About

Languages

License

jerrybelmonte/DataStructures-Python

Folders and files

Latest commit

History

Repository files navigation

Data Structures in Python

Table of Contents

Objective

List ADT

Binary Search on Sorted List

Quicksort Unsorted List

Linked List

Stack ADT

Check Brackets In Text

Stack With Maximum

Queue ADT

Maximum In Sliding Window

Convert Array Into Heap

Parallel Processing

Hashing

Phone Book

Hashing With Chains

Find Substring Pattern

Substring Equality

Longest Common Substring

Trees

Binary Search Trees

Technologies

License

About

Resources

License

Stars

Watchers

Forks

Languages