Binary Tree Puzzles and Interview Questions

Many of the "programming puzzle of the day", or "dev job interview questions" relate to binary trees. This repo contains a binary tree implementation in a Go package, as well as code that solves puzzle-or-problem-of-the-day questions.

I do have other binary tree repos that illustrate problems too big to fit in this repo:

Some of the problems and puzzles below should be their own repos because of their size, but the convenience of a single binary tree package is too great to break them out.

Building

Support code resides entirely in package tree, which lives in the tree/ directory. Answering the questions or solving the puzzles almost always means doing something different than the generic support code does, so the code or algorithms to solve problems lives in the top level directory.

Build and test goes something like this:

$ make all
$ ./runtests
$ cd tree
$ go test -v .

Some of the programs aren't included in the make all. You'd build them like this:

$ go build secondlargest.go
$ ./secondlargest '(3 (1 0 2) (5 4 6))'
(3 (1 0 2) (5 4 6))
Largest value 6
Second largest value 5

A lot of the programs output GraphViz dot-format descriptions of the answers. That way you can visually check that the code does what it's supposed to do.

Questions and programs

I have a bigger binary tree question, recreate binary tree from its post-order traversal, in its own repo Still can't believe they marked that one "medium".

Create a randomly valued tree.
Create a GraphViz drawing of a tree. This code creates a binary search tree (BST) by inserting values as they appear on the command line. I believe you can create a BST of any shape by inserting the values of nodes of a BST with the desired shape in breadth-first order.

Daily Coding Problem: Problem #842 [Medium]

This problem was asked by Google.

Invert a binary tree.

For example, given the following tree:

    a
   / \
  b   c
 / \  /
d   e f

should become:

  a
 / \
 c  b
 \  / \
  f e  d

Code #1

Code #2

Code #3

For attempt #1, I had it create a binary search tree of random-valued nodes so that the inversion is obvious. I wrote this one before generalizing tree.NumericNode and tree.StringNode into interface tree.Node. At that time, the data and left, right child pointers weren't exported, nor did access functions exist. I made func (p *NumericNode) Invert() a method of tree node struct and put it in the support code.

Attempt #2 I wrote well after generalizing numeric and string node types, but I chose to use tree.StringNode so that I could get exactly the output tree that the problem statement specifies. I also used func invert(node *tree.StringNode) instead of a method on the node's type. I'm torn about object oriented programming, and "inverting a tree" seems less like a operation performed on a node, which would seem to need a method, and more like operating on the entire data structure.

Attempt #3 uses tree.Node Go interface types to see if that's possible, and it is.

In all attempts, I found that checking for a nil node pointer is best done in function or method invert, to avoid complicating the inversion function with 2 tests for nil children. This has become a recurring motif.

This has been at least 3 different "Daily Coding Problems", two rated "[Medium]"

Found a very good tutorial walking through in detail how to do this problem in C.

Daily Coding Problem: Problem #752 [Easy]

Also, Problem #107.

This problem was asked by Microsoft.

Print the nodes in a binary tree level-wise. For example, the following should print 1, 2, 3, 4, 5.

Analysis

Breadth-first traverse iterative traverse of tree.

This is an old one: instead of using a stack (implicit function call stack, or an explicit data structure), the algorithm uses a FIFO queue to keep track of its place in the traverse of the tree.

$ go build breadthfirst.go
$ ./breadthirst '(1(2(4)(5))(3(6)(7)))'
1 2 3 4 5 6 7
$

Due to the simplicity of this problem, maybe interviewers should use it only on entry-level candidates.

Daily Coding Problem: Problem #622 [Easy]

This problem was asked by Google.

Given the root of a binary tree, return a deepest node. For example, in the following tree, return d.

    a
   / \
  b   c
 /
d

Analysis

First cut at finding depth of tree, carries a struct around.
Second cut at finding depth of tree, completely recursive, returns deepest node.
Third cut at finding depth of tree. Uses tree.Node interface, func tree.AllorderTraverseVisit. No globals involved.
Fourth cut at finding depth of tree, using Go channels. This is an example of Go-peculiar program structure, leveraging channels and goroutines.

The problem statement confuses data with data structure. The "deepest node" of the example isn't 'd', but 'd' is the value of the deepest node. Interviewers should be clear, in my opinion.

This is a question for an entry-level position interview. It involves a traverse of a binary tree where the value of the nodes is only for identification, so pre-, post- or in-order doesn't matter. The only things to do are recognize a leaf node, and keep track of depth in the tree. The interviewer could look for orderly design process, neat and tidy coding, entry-level things like that.

If the candidate were to suggest test cases, a single-node tree, left-heavy like the example, a complementary right-deep test, and some case in the middle might be in order.

I did the third version as an exercise: see if visitor functions and interface type tree.Node could do the task.

I did the fourth version to show a Go-peculiar design pattern. Have a goroutine generate some plausible values, writing them to a channel. The main goroutine reads from the channel and filters the plausible values into desired value(s). Recursive functions are simplified, since the code for filtering the plausible values lives in the main goroutine, and nothing has to be returned: possible answers get written to a channel.

Reconstruct a tree from traversals

This is Daily Coding Problem: Problem #435 [Medium]

Given pre-order and in-order traversals of a binary tree, write a function to reconstruct the tree.

For example, given the following preorder traversal:

[a, b, d, e, c, f, g]

And the following inorder traversal:

[d, b, e, a, f, c, g]

You should return the following tree:

    a
   / \
  b   c
 / \ / \
d  e f  g

Analysis

A clever solution exists. Isn't this O(n²), though? It's also a leetcode problem.

The in-order traversal gives you an ordering of the elements. You can reconstruct the original binary tree by adding elements to a binary search tree in the pre-order traversal order, with "<=>" determined by the in-order traversal, instead of using <, >, == built-in operators on data values to make comparisons. My code constructs a map[string]int where the keys are strings from the in-order traverse and the values are the indices of those strings when they're in-order in an array. func insert in this code can decide which child to recurse down by getting numeric values from the map and comparing those. func insert looks a lot like an ordinary recursive function that inserts values to create a binary search tree.

At the time I encountered this problem, it suggested that my tree package func Insert could be generalized. Originally, type tree.Node carried an int value. I changed the int-valued struct to tree.NumericNode, and made tree.Node into a Golang interface. Later I added tree.StringNode, for problems that require a binary tree of strings.

I created a generalized binary tree node, type tree.Node, a Go interface type. Types tree.NumericNode and tree.StringNode, both structs, have accompanying methods that make them satisfy the tree.Node interface. Function tree.GeneralCreateFromString can and is used to create binary trees with integer value nodes (func CreateNumericFromString) and string value nodes (func CreateFromString).

Interview Analysis

An interviewer asking this question would have to decide what they wanted from the candidate. If a candidate had that flash of insight that let them create the clever algorithm, is that candidate suitable for an "enterprise" programming role where boring, grind-it-out, lots of boilerplate and standard following is necessary?

Return all paths from the root to leaves

Another daily coding puzzle, "Daily Coding Problem: Problem #587 [Medium]".

Given a binary tree, return all paths from the root to leaves.

For example, given the tree

it should return [[1, 2], [1, 3, 4], [1, 3, 5]].

Analysis

The phrasing of the answer seems to assume the use of Python. My program creates a binary search tree from number representations on the command line, then traverses the newly-created tree. It adds each node's value to the current path when that node gets visited by the traverse. At leaf nodes, it copies the path, and keeps that copy. As the traverse leaves a node, the code trims the node's value from the current path. I had to write a recursive visitor function that includes pre-order and post-order function calls, and write a type that could be used to accumulate paths at leaf nodes, and also kept the current path updated.

I'd say this is actually about a medium difficulty interview problem, suitable for whiteboarding with an above-junior-level programmer. The candidate would have to understand recursive functions, but use those recursive functions as a scaffold for the work to find the answer.

Daily Coding Problem: Problem #540 [Easy]

Also: Daily Coding Problem: Problem #810 [Easy]

This problem was asked by Morgan Stanley.

In Ancient Greece, it was common to write text with the first line going left to right, the second line going right to left, and continuing to go back and forth. This style was called "boustrophedon".

Given a binary tree, write an algorithm to print the nodes in boustrophedon order.

For example, given the following tree:

       1
     /   \
    /     \
  2         3
 / \       / \
4   5     6   7

You should return [1, 3, 2, 4, 5, 6, 7].

Analysis

I did this with two stacks, one to hold nodes for a rightward breadth-first pass, the other for a leftward breadth-first pass. I switched stacks from which to pop parent nodes when the current parent-node-stack turns up empty. You could probably perform the task with a single double-ended queue and a special marker node that indicates when to change direction.

There is no way this is an "easy" whiteboarding problem. The child nodes in a rightward pass are the parent nodes in the succeeding leftward pass: you have to think an entire "layer" of the tree ahead to decide what operation (pop or dequeue) to do in the next pass, and you have to think an entire layer behind to decide in what order to push or enqueue the child nodes. It's also very easy to make mistakes in the code, because pushing child nodes on the stacks happens in different orders depending on the direction the code is traversing the parent nodes' layer.

You'll want to try 1, 2, 3, and 4 deep input trees, both complete, and with a partial deepest layer.

$ ./bous 7 3 11 1 5 9 13 0 2 4 6 8 10 12 14 
7 11 3 1 5 9 13 14 12 10 8 6 4 2 0

should show you a boustrophedon traverse of a complete binary search tree of depth 4.

Cousin Nodes #1

Daily Coding Problem: Problem #487 [Medium]

This problem was asked by Yext. Whoever Yext is.

Two nodes in a binary tree can be called cousins if they are on the same level of the tree but have different parents. For example, in the following diagram 4 and 6 are cousins.

Given a binary tree and a particular node, find all cousins of that node.

Analysis

This solution doesn't meet the problem statement. See Cousin Nodes #2, below. I was misled by my intuition of what "cousins" means genealogically in the USA and by the example given.

My solution.

At first, I thought this was a fairly bogus question, but this might actually be a good interview problem. It does contain the opportunity to do a recursive or iterative search of a binary tree, allowing the candidate to demonstrate algorithm knowledge. It has a task that requires synthesizing a solution from several parts (finding a node, finding a parent node, finding uncle, finding cousin(s)), allowing the candidate to demonstrate thinking through a nonsensical problem, and let's face it, business logic is often an ambiguity wrapped up in a special case, inside some regulatory capture. It has the opportunity to point out test cases (can't find node, can't find grandparent, can't find uncle, 0 through 2 cousins), and it allows the candidate to demonstrate some insight (only 1 node can be parent of cousins, it's a binary tree).

Cousin Nodes #2

This problem was asked by Yext. Whoever Yext is.

Two nodes in a binary tree can be called cousins if they are on the same level of the tree but have different parents. For example, in the following diagram 4 and 6 are cousins.

Given a binary tree and a particular node, find all cousins of that node.

Analysis

My solution.

I see that I did Cousin Nodes #1 incorrectly. I misread the problem statement: cousin nodes aren't nodes that share a grandparent node, they're not biological cousins. Cousin nodes are just in the same depth in the tree.

The solution has 3 parts:

Construct the tree
Find depth of the "particular node".
Traverse tree, printing out or otherwise collecting all nodes at the same depth as the "particular node"

This is actually easier than the problem that I made up when I misread the problem statement. Finding the depth of the "particular" node, and finding all nodes at the same depth as that node are completely recursive, and can be written compactly. With a little care about recursion termination, you can be pretty confident you've got the right answers.

I'm less certain this is a good interview question. The heart of the difficulty is calling the nodes you want to find "cousin" nodes. "Cousin" has a well-defined familial or genealogical meaning, and it's not what this problem wants. The explanatory diagram is misleading, in that it implies that a biologically-inspired reading is correct. It seems to me that this problem statement is the difficulty, and that the interviewer might find out programming skill, but will mainly be sarcastically amused when most of the candidates don't get the correct answer.

Prune a tree so that subtrees containing all 0s are removed

Given a binary tree where all nodes are either 0 or 1, prune the tree so that subtrees containing all 0s are removed.

For example, given the following tree:

should be pruned to:

We do not remove the tree at the root or its left child because it still has a 1 as a descendant.

Analysis

My code does this with a post-order traverse of the entire original tree, pruning the tree as it goes.

By pruning left and right children, then checking to see if both child nodes are null and the current node's data is 0, the code can return the node or nil.

A leaf node with 0 value gets pruned by setting its parent's pointer to it to nil. If both child pointers of the parent have a nil value, or get set to a nil value in the traverse, the parent can be pruned by returning nil.

Perceiving a post-order traverse as a way to prune requires the ability to not shy away from doing something via brute force, and also a willingness to put off the work of pruning until both of a node's subtrees have been pruned.

I suspect that a lot of candidates would try to traverse the tree and write a function prunable(node *Node) bool, calling pruneable on each child node. This would get you wrapped around the axle of recursion, because prunable() would have to recurse each node's two child sub-trees, and then you'd want to write code to recurse into each of the sub-trees.

This is a pretty good interview question, if you want to see if the candidate has that flash of intuition, which may not arrive during the pressure of a whiteboard question.

Minimum height binary search tree

I seem to have mislaid all other information about this problem other than this simple statement:

Given a sorted array increasing order of unique integers, create a binary search tree with minimal height.

Analysis

Ordinarily, this would take quite a bit of work to get correct, but the "sorted array" clause makes it possible.

My code take advantage of the sort by using the middle element of the array as the root of the root. The sub-arrays on either side of the middle element will be binary search trees of the left and right children of the root of the tree, so the code can recurse. The base case actually comprises 3 special cases: 1, 2 and 3 element sorted arrays each get treated on their own. it was too hard to deal with a length 2 array with the general case code, and 3 element was too easy to not treat specially. Also, there's a choice of two arrangements for the length 2 case:

1
 \
  2

or

  2
 /
1

I have my code choosing one of them pseudo-randomly, just for equity. That is, it can create different minimal height trees from different runs with the same input.

I also wrote the straightforward version of this. It's a lot clearer, but it always creates the same tree every time from any given output.

Interview Analysis

This seems like a harder interview question. The interviewer should prepare to prompt the candidate. It requires some insight to notice that "sorted array" and "binary search tree" have a relationship via the middle-node-becomes-root, and that the relationship is recursive. Also, the choice of middle node for an even-length sorted array makes the actual coding a little harder. The candidate probably writes code to always chose one or the other. Calculating the middle index of a 0-indexed array causes humans troubles. It's easy to be off-by-one, and it's not the more familiar fence post problem. The interviewer shouldn't expect anything close to a good solution.

The candidate could amaze the interviewer by providing insightful test cases. Not only is a 1-node tree a good test case, but so are 2-node trees. Test cases that cause the algorithm to decide between two "middle indexes" of the array would be good. Something like an input of [0, 1, 2, 3] has 4 different trees of height 3. That's worth a test. "Slightly incomplete" trees, missing only 1, 2, 3 or so nodes from having a complete set of leaf nodes would be good test cases.

The candidate could also amaze the interviewer by proposing a test for whether or not the tree is of minimal height. A complete binary tree of depth D has N = 2^D-1 nodes. We know how many nodes are in the tree, we got an array as input. Find the depth of the tree, see if it's less than or equal to log₂(N+1)+1. This program does that.

The above image shows that the actual, discrete, tree heights follow a step function. Height of a tree is constant while filling in the bottom row of leaves. Complete binary trees have a height matching log₂(N+1). Binary trees with one leaf in the bottom row of leaves have a height matching log₂(N+1)+1. All other trees have a height between the two values.

Daily Coding Problem: Problem #545 [Hard]

Given a binary tree, find the lowest common ancestor (LCA) of two given nodes in the tree. Assume that each node in the tree also has a pointer to its parent.

According to the definition of LCA on Wikipedia: The lowest common ancestor is defined between two nodes v and w as the lowest node in T that has both v and w as descendants (where we allow a node to be a descendant of itself).

Analysis

My code.

It says "binary tree", not "binary search tree", so you can't assume an ordering.

The "assume that each node in the tree also has a pointer to its parent" is almost certainly a clue that they don't want the obvious solution, which is to find paths to V- and W-valued nodes, then compare paths to find the last common ancestor. The node that appears in both paths last is deepest in the tree.

Apparently this 18 page paper describes a sub-linear algorithm for finding the LCA, using back-links to parents. It's entirely unobvious, and I refuse to bother with it.

This is a weird problem to ask in an interview. Unless the interviewer wants a candidate who's read, understood, and memorized all of Robert Tarjan's many, many algorithms, nobody will pass this. Everyone will do the O(n) time algorithm, or waste all their time trying to recreate something inobvious.

Count nodes in less than linear time

This also appears as Daily Coding Problem: Problem #736 [Easy]

Given a complete binary tree, count the number of nodes in faster than O(n) time. Recall that a complete binary tree has every level filled except the last, and the nodes in the last level are filled starting from the left. "Complete" means: every level, except possibly the last, is completely filled, and all nodes in the last level are as far left as possible. It can have between 1 and 2h nodes at the last level h.

Incorrect Algorithm Analysis 1

Incorrect code

I know "complete binary tree" as "tree having the heap property".

Solutions easily found on the web don't do the actual work, the authors merely describe how to do it. It's harder to do it than to wave your hands about it.

Here's one way to do it

Suppose you number the child pointers, 0 for left child pointers, 1 for right child pointers. Give the leaf nodes labels based on which pointers the path to the leaf node takes. The leftmost leaf node is 000 in this diagram, the rightmost is 111. If you interpret the labels as binary numbers, the leaf nodes are labeled sequentially. Taking the bits of the binary numbers as instructions on which child pointer to choose, a node with a given label can be found in O(h) time.

If the leftmost depth is 1 greater than the rightmost depth, the tree isn't filled-in. You can do a binary search on the labels of the leaf nodes, because the digits of the labels are also the direction of child node pointer on which to recurse to get to the leaf node. You know that the leftmost node is labeled "00...00" and the rightmost node is labeled "11...1" (in this case it is 1 digit in length less than the leftmost node's label). The next tree depth to probe is the node labeled (11..1 + 00..00)/2, when interpreting node labels as numbers. The division-by-2 can be accomplished by shifting the digits to the right, minding the carry bit left from adding. If you don't correctly handle the carry bit, shifting the result of adding labels one bit to the right will give you gibberish when viewed as the left/right pointer choices.

Another subtlety exists when choosing when you've found the target of the search: I believe you want to compare the added-and-shifted mid-point label to the left side of the binary search bracketing labels. When the added-and-shifted mid-point equals the left-bracketing-label, you've found the deepest rightmost node.

That gets you a O(log₂m) search for the deepest rightmost node, where m is the number of nodes in the bottom-most rank of the tree.

My code does just such a binary search. It also counts the number of node accesses it does to find the rightmost node in the final level of the tree.

A log₂(N) curve fits the experimental count of node accesses very well. The constant factor here is about 9 - I just eyeballed that, I didn't curve fit.

If I drawn a line, y = x on the graph, I observe that for n < 45, my algorithm touches more than n nodes: There's a large constant on log₂m in the code.

The node-labeling algorithm is a O(log₂m) search for the deepest rightmost node, where m is the number of nodes in the bottom-most rank of the tree, if the tree was full at the left-most depth.

Each of the probes for the rightmost deepest node involves touching about as many nodes as the left depth of the tree. The O() in the binary search algorithm is usually the number of slots in a sorted array or list, not the number of nodes accessed.

I forgot to consider that the count of node accesses during binary search for the rightmost bottom-row node touches a lot of nodes more than once. For example, the binary search accesses the two children of the root node many times. If I count each access, my method fails a strict O(n) criteria.

I was blinded by the O(log₂h) complexity of binary search. In the context of binary search the h is number of nodes in the bottom level of the tree, and the O() measures comparisons of node values, as is traditional for complexity analysis of search algorithms.

If you work it all out, I think this algorithm is O(log₂(n+1)), for a tree with n nodes.

I reckon that O(log₂(n+1)) is less than O(n), so there must be a large constant in my actual code.

Another Algorithm Analysis

If you search for this problem on the web, you'll find a clever alternate solution, which is usually poorly explained.

Define a function something like this:

func countNodes(node *tree.Node, leftDepth, rightDepth) (nodeCount, nodeTouches in)

The leftDepth is the depth of the tree just following the left children. The rightDepth is the depth of the tree following only the right children.

Left depth and right depth will either be numerically equal, or left depth will be one greater.

If leftDepth evaluates to -1, find the left depth, this can be recursive or iterative. You've just touched leftDepth nodes.

Do the same for the right depth if formal argument rightDepth is -1.

If left depth and right depth equate numerically, the tree has 2^D - 1 nodes.

If left depth and right depth don't equate:

Call the function recursively with the left child node as the formal argument node, pass leftDepth - 1 for the left depth argument, -1 for the right depth argument.
Call the function recursively with the right child node as the formal argument node, pass -1 for the left depth argument, rightDepth - 1` for the right depth argument.

This recurses until a subtree has equal right and left depths (which are pre-calculated), where the function can return a node count for the subtree without further touches. Because the function usually touches extra nodes to calculate one of left depth, or right depth, it minimizes the number of nodes it accesses

By having left child nodes calculate right depth, and right child nodes calculate left depth, the function does a binary search for the leftmost node in the bottom rank of the tree.

You must be careful how you calculate left and right depth for a given node. Find depths from left and right child nodes of the given node to avoid re-touching the given node.

If you write code for this algorithm and follow all the "conservation of node access" techniques, you get an algorithm that is strictly less than O(n), where n is the number of nodes in a complete (but maybe not "full") binary tree. I'm not sure exactly what the time complexity of this algorithm is, but it's got to be O(log n) or close: it often doesn't have to touch half the nodes in a sub tree.

Interview Analysis

Since I would have failed this interview, you can take the following with a grain of salt.

This isn't a bad interview question, if the interviewer is after a candidate's understanding of computer science. If the interviewer is satisfied with a more-or-less handwaving explanation, or giving this as a take-home problem, it's fine. But it's not "easy" after the hand-waving It contains a lot of subtleties that would cause wasted time in a whiteboard coding experience. The candidate wouldn't demonstrate anything worthwhile on a whiteboard, just that they can puzzle over corner cases, and understand how integer division works in their favorite language. Letting a candidate analyze the problem out loud, skipping over some of the details might be the best way to learn if your candidate has problem solving abilities.

The leaf-node-labels-as-pointer-following-directions trick is also fairly subtle. I discovered it by accident. It's one of those "use a value as a number and also as something else" tricks that can make a speedy algorithm, and give clarity to an analysis, but are usually hard to see without lengthy puzzling over the problem.

If the candidate happens to know about the trick of using a variable-length array to store heap-shaped binary trees

Daily Coding Problem: Problem #502 [Easy]

Given a binary tree, determine whether or not it is height-balanced. A height-balanced binary tree can be defined as one in which the heights of the two subtrees of any node never differ by more than one.

Analysis

My code.

This is essentially the tree depth (or tree height) problem framed differently. As such, it's prey to all of the tree depth problem's difficulties. The interview candidate might fall into the trap of trying to write a Balanced() function that's recursive on its own, rather than finding max depth of each subtree then ensuring that any depth difference is not too great. The interviewer might not get a feel for the candidate's coding ability at all.

Daily Coding Problem: Problem #133

Also, Daily Coding Problem: Problem #609 [Medium].

Given a node in a binary tree, return the next bigger element, also known as the inorder successor.

For example, the inorder successor of 22 is 30.

The Problem #609 version also says:

You can assume each node has a parent pointer.

The example tree does have the Binary Search Tree property, but the written statement doesn't say that input trees have that property. My solution does not assume the input tree has that property. Perhaps it should have.

Analysis

My code.

I thought this would be easier than it was. My initial idea was to just do an in-order traverse of the tree, and if either child node has the given value, return the value of the current node.

This is wrong, as it misses the case where any node with 2 children (10 or 30 in tree above) is the given value. The successor value is the value of the right child node. It also misses the case where the given value is 35 in the tree above, the maximum value in the tree. It has no inorder successor.

For the tree above, each value in the tree has these inorder successor values:

Given Value	Inorder Successor
5	10
10	22
30	35
22	30
35	-

There's a missing case in the example tree the candidate would want to check.

If the given value is 6, the inorder successor node has a value of 10.

I think there's another corner case, where the given value doesn't exist in the tree. I suppose you could parse the problem's language to declare that case doesn't exist.

I declare this problem statement to be cunningly misleading.

This isn't one of those "flash of insight necessary" problems. In that respect, it's a decent interview problem. The interviewer should tailor their expectations for solutions based on what the candidate claims their experience is. Less senior developers would be lucky to write code that partially works. More senior developers might get a good analysis of the problem, but still have trouble writing code that works on every case.

The candidate would do well to analyze test cases first for this problem. That would give the candidate enough information to have a fighting chance to get a correct algorithm.

I ended up with code that had a complicated recursive in-order traverse. I interpreted "binary tree" in the problem to mean "unordered", not a Binary Search Tree. If "binary tree" means "binary search tree", then my code isn't the greatest: it will take forever to search for a given value in a large tree.

The search is one function that gets called with a *tree.NumericNode, the given value, and an instance of a 3-valued type. The 3 values of this type represent:

Looking for the given value
Found the given value
Found the inorder successor

Note that the search function receives one of these 3 values as an argument, and it needs to consider what the values mean when returned from a recursive call.

If the search function receives "found the given value" as an argument, the current node is the inorder successor. It returns its own value and "found the inorder successor".

If the current node is not the inorder successor, the left sub-tree can contain the given value. If the left child has the given value, the current node is the inorder successor. The search function can return the current node's value, and "found the inorder successor".

If the current node has the given value, either the right child is the inorder successor, or the inorder successor is up the tree.

There's a subtlety if the programmer calls the search function with a nil (or NULL) current node. This is common practice because it avoids having two checks (left and right child) for nil/NULL pointers, and it declutters recursive traverse code. A nil/NULL current node pointer has to return the value of the 3-valued type argument it's called with, rather than "looking for the given value".

Of course, the quick-and-dirty method of creating an array from an in-order traverse of the tree, finding the given value in the array, then returning the value at the next index would also work. It might be worth the candidate's time to mention this. The drawbacks? O(n) extra space for the array, O(log N) extra time for a binary search of the array. The recursive method only uses O(D) (D depth of tree) extra space.

I'm not at all sure what difference the parent pointers of the Problem #609 version would make. The "missing case" of the inorder successor of 6 in the tree above basically makes the parent pointer useless. I guess you could find the first number, then follow parent pointers until you find the next node with a bigger value than the first number, but off hand, I can't say that would always work.

Daily Coding Problem: Problem #490 [Medium]

This problem was asked by Yelp.

The horizontal distance of a binary tree node describes how far left or right the node will be when the tree is printed out.

More rigorously, we can define it as follows:

The horizontal distance of the root is 0.
The horizontal distance of a left child is hd(parent) - 1.
The horizontal distance of a right child is hd(parent) + 1.

For example, for the following tree, hd(1) = -2, and hd(6) = 0.

           5
         /   \
       /       \
      3         7
    /  \      /   \
  1     4    6     9
 /                /
0                8

The bottom view of a tree, then, consists of the lowest node at each horizontal distance. If there are two nodes at the same depth and horizontal distance, either is acceptable.

For this tree, for example, the bottom view could be [0, 1, 3, 6, 8, 9].

Given the root to a binary tree, return its bottom view.

Analysis

This is a largish problem for a whiteboard, the candidate who gets this question should talk it out, put some very high level pseudocode on the whiteboard to show you understand it before doing any "coding".

I doubt there's an actual "best solution". My solution:

Build a tree with nodes that have depth and horizontal distance fields.
Traverse tree, filling in depth and distance fields.
Traverse tree again, filling in a Go map with the deepest node for any given horizontal distance.
Construct an array of the deepest nodes that appear in the map.
Sort the array based on horizontal distance.
Print the sorted array.

These steps could be condensed into a single pass:

While building the tree, fill in depth and distance fields. Get this working, it's harder than either building the tree or filling in depth and distance.
While building the tree and calculating depth and distance, insert nodes into the map.

I guess this would be an O(n) algorithm, because it visits every single node in the tree.

This could be a good interview question, if the interviewer adjusts expectations for the level of the job. Entry level programmers could be coaxed to discuss binary trees, and how they might calculate depth and distance recursively. Mid-level programmers might sketch an overall solution, and implement parts on the whiteboard. Senior programmers might end up implementing the whole thing, if allowed enough time.

Daily Coding Problem: Problem #490 [Medium]

This question was asked by Apple.

Given a binary tree, find a minimum path sum from root to a leaf.

For example, the minimum path in this tree is [10, 5, 1, -1], which has sum 15.

Analysis

My solution.

First off, the request is phrased poorly: do they want the sum, the path or both? The candidate would be wise to ask. Returning the path is more work than just the sum.

Since the problem seems to allow negative numbers as node values, there's no way to short-circuit a complete traverse of the tree. The recursion needs to carry around a minimum sum found so far, and the associated path because of this.

Depending on what language the candidate does this problem, solving it may entail some array management code to keep from re-appending pieces of the paths through the tree. That's really the only tricky piece to the recursion.

The candidate could distinguish themself by suggesting test case input trees - a single element tree would be a decent test case, as would extreme left- and right-hand-side minimum-sum-paths. Maybe a tree with a large negative value in the leaf node would be a good test case.

The interviewer should only expect a simple recursive solution. There's no opportunity for cleverness or short-circuiting a traverse of the entire tree.

This might actually count as a medium-difficulty problem. It requires no great insight to solve, but the candidate would have to incorporate a few extras in an otherwise simple recursive solution.

Daily Coding Problem: Problem #482 [Medium]

This problem was asked by Google.

Given a binary search tree and a range [a, b] (inclusive), return the sum of the elements of the binary search tree within the range.

For example, given the following tree:

and the range [4, 9], return 23 (5 + 4 + 6 + 8).

Analysis

The obvious solution is to do a traverse of the input tree, carrying around a pointer to an int, the sum so far. At each node, check if the value is in the range [a, b], and add it to the pointed-to-int. I did this in func visit1 of my solution code.

The problem says "Given a binary search tree". That means that a program can use that property to avoid visiting nodes that have values less than a and greater than b. I did this in func visit2 of my solution.

My thought is that any time a binary tree problem or puzzle says that the tree has the binary search property, it's worth considering how to make that problem faster using the BST property. Seeing a candidate recognize and use that property is what this question is all about, for an interviewer.

The candidate should consider test cases, like giving a range that doesn't match any node's value, or excludes all nodes, or includes only one node. The shape of the input tree could also matter, so consider a tree that's effectively a linked list: all of the right-hand-children are non-null, but none of the left-hand-children are.

I don't believe this qualifies as a "medium" hard question. Consider Daily Coding Problem #475, which follows. That's a "medium" too, and much more difficult.

Daily Coding Problem: Problem #475 [Medium]

This problem was asked by Google.

Implement locking in a binary tree. A binary tree node can be locked or unlocked only if all of its descendants or ancestors are not locked.

Design a binary tree node class with the following methods:

is_locked, which returns whether the node is locked
lock, which attempts to lock the node. If it cannot be locked, then it should return false. Otherwise, it should lock it and return true.
unlock, which unlocks the node. If it cannot be unlocked, then it should return false. Otherwise, it should unlock it and return true.

You may augment the node to add parent pointers or any other property you would like. You may assume the class is used in a single-threaded program, so there is no need for actual locks or mutexes. Each method should run in O(h), where h is the height of the tree.

Analysis

It's weird that "locking" takes places in a single-threaded program with no need for actual locks or mutexes. I suspect this is to avoid all the ugliness of what needs to get locked, and when.

The problem statement says:

A binary tree node can be locked or unlocked only if all of its descendants
or ancestors are not locked.

This translates to "locking a node locks the sub-tree below it", and "you can't lock a node inside a locked sub-tree".

The problem has a giveaway hint:

You may augment the node to add parent pointers or any other property you would like.

If you add parent pointers, finding out if any ancestors are locked is O(log₂(N+1)), where N is the number of nodes in the tree, i.e. the depth of the tree. The nodes can keep a count of locked descendants, eliminating the need for traversing sub-trees to find any locked nodes.. Any node, even unlocked nodes, that have more than zero locked descendants aren't eligible to lock. Unlocking a node involves chasing parent pointers to the root, decrementing the count of locked descendants.

My solution.

I created a locking binary tree type, and a program that lets you interactively create trees, lock and unlock nodes, and inspect trees and nodes.

./locktree
Locked binary tree explorer
> ?
locking node binary tree explorer
Operations:
print - print lisp-like string rep of tree
checkall - show lock status of all nodes
check N - show lock status of node with value N
lock N - lock node with value N
unlock N - unlock node with value N
find N - print info about node with value N
create (...) - parse lisp-like tree rep, use it thereafter
> create (0(1()(2))(3()()))
> lock 2
locked node with value 2 at 0xc0000983f0
> print
(0/U(1/U()(2/L))(3/U))
> lock 1
did not lock node with value 1 at 0xc0000983c0

The program didn't lock node with value 1 because node with value 2, in it's right sub-tree, was already locked.

The conditions on locking and unlocking are to keep a sub-tree of a given node locked, and not re-lock sub-trees of nodes in the locked sub-tree. It's good that the problem says not to use mutexes, and that the code is to be used in single-threaded programs, because actually locking a node would seem to involve locking the entire tree to chase parent pointers. I'm at a loss to explain what's going on with this problem, except that maybe it's nonsensical nature is to get candidates to solve a problem they've never seen before.

From that standpoint, the interviewer has to watch for 2 things:

That the candidate has the insight that a parent point can allow O(h) lock and unlocks by chasing those parent pointers.
Programming mechanics. Chasing pointers might be unfamiliar to people who don't do C or Go any more. The C++ subculture is such that raw pointers seem to be considered taboo. Maybe this problem exists to separate the pointer-familiar sheep, from the pointer-less goats.

Perhaps the interviewer is supposed to be satisfied with a design, or a design and an implementation with some flaws that would get ironed out by a little testing.

Daily Coding Problem: Problem #644 [Easy]

A unival tree (which stands for "universal value") is a tree where all nodes under it have the same value.

Given the root to a binary tree, count the number of unival subtrees.

For example, the following tree has 5 unival subtrees:

Build and run Unival tree program

$ go build unival.go
$ .unival '(0 (1) (0 (1 (1) (1)) (0)))'

5 unival subtrees

Unival tree analysis

I think I see the 5 unival subtrees: 4 leaf nodes, which are vacuously unival trees, since all of their subtrees, of which there are zero, have the same value. The final unival tree is the all-1-value subtree.

The solution actually took me 1 decent false start, and some thinking. The solution is entirely recursive, although the problem statement hides it.

Leaf nodes have no children, therefore they have the same data value as their children. A leaf node is a unival tree. If the current node has the same value as its left and right children, and the child nodes are roots of unival subtrees, the current node and its subtrees form a unival subtree. You can do this in a post-order traverse, but there's a complicated decision about whether the current node is the root of a unival subtree.

I'm going to go with this problem is under-rated. It's at least a medium, given the complicated tests to decide if the current node heads a unival subtree. You've got to handle either or both child nodes don't exist, and you've got to decide if they exist, do the make the current node the root of a unival subtree. One child might be the root of a unival subtree, but have a different value than the current node.

The candidate could score points by noting that one child could be the root of a unival subtree, while the other isn't, so more testing is required to verify correctness.

Daily Coding Problem: Problem #651 [Medium]

This problem was asked by LinkedIn.

Determine whether a tree is a valid binary search tree.

A binary search tree is a tree with two children, left and right, and satisfies the constraint that the key in the left child must be less than or equal to the root and the key in the right child must be greater than or equal to the root.

Analysis

This particular problem statement seems sloppily worded. It reads as if they only want the candidate to consider 3-node trees. It also conflates "value of node's data" with "node itself". Sloppiness like that will lead to amazing bugs.

The other oddity in the problem statement is that it says:

the key in the left child must be less than or equal to the root and the
key in the right child must be greater than or equal to the root.

When building such a tree, you get to make a decision: does a value equal to that of the current node become the left child node's value, or the right child node's value?

I'm not even going to implement this one: it's garbage. This could be implemented by 2 if-then-else tests. There's no art or craft to this one.

The only reason to ask this question, if my reading is correct, is to see if a candidate can withstand ambiguity and do a really dumb task without complaining. Candidates who are asked to do this problem should reject that corporation's job offer. There's better environments to work in, don't take the job.

I note that some extremely sloppy web pages have similar definitions of binary search trees:

the key in each node of a BST is greater than or equal to any key stored in the left subtree and less than or equal to any key stored in the right subtree.

If you have to have duplicate keys in your trees, you need to decide something like "left tree is less than or equal to and right tree is greater than" so that you don't have to examine every node to find a key. If you don't have duplicate keys, make the criteria strictly less than and strictly greater than.

Daily Coding Problem: Problem #442 [Hard]

This problem was asked by Netflix.

A Cartesian tree with sequence S is a binary tree defined by the following two properties:

It is heap-ordered, so that each parent value is strictly less than that of its children.
An in-order traversal of the tree produces nodes with values that correspond exactly to S.

For example, given the sequence [3, 2, 6, 1, 9], the resulting Cartesian tree would be:

Given a sequence S, construct the corresponding Cartesian tree.

Analysis

There are several methods for constructing a Cartesian tree, none of them obvious.

I haven't done this yet.

Daily Coding Problem: Problem #422 [Easy]

This problem was asked by Salesforce.

Write a program to merge two binary trees. Each node in the new tree should hold a value equal to the sum of the values of the corresponding nodes of the input trees.

If only one input tree has a node in a given position, the corresponding node in the new tree should match that input node.

Analysis

My version of it requires 2 trees on the command line, each in a lisp-like syntax. The "-g" flag gives you GraphViz dot-language output, which can be converted into an image for viewing.

$ go build merge.o
$ ./merge -g '(1()(3(3)(3)))' '(1(2(2)(2))())'  > m.dot
$ dot -Tpng -o m.png m.dot
$ feh m.png # Or whatever your favorite viewer is

This makes a good interview problem for a candidate for a junior-level job. It's just nonsensical enough that nobody has done it on the job, but it requires some Computer Science and some experience.

The problem can be handled with standard binary-tree-recursive thinking. Realize that you need a recursive function, find the base case where recursion stops, handle the recursive call(s). There may be more than one base case, where a nil argument pointer gets passed in to avoid duplicate tests for nil where it handles recursive call(s).

The "merge" function takes two nodes (one from first tree, one from second tree) and returns a merged node.

If the merge function gets two nil nodes, it returns nil. This is the case where recursion stops.
If the merge functions get a non-nil and a nil node, it creates a new node with the value of the non-nil node, and calls the merge function on the non-nil node's children to fill in left and right children of the new node. Return the new node. This is 2 cases in the code, left tree nil, right tree non-nil, and vice versa.
If the merge function gets 2 non-nil nodes, create a new node with the sum of the input nodes' values, call the merge function on the left children of each node to create the left child of the new node. Call the merge function on the right children of each node to create the right child of the new node, Return the new node.

This does require a little insight to realize that the requirement for

If only one input tree has a node in a given position,
the corresponding node in the new tree should match that input node.

means chasing only the non-nil child. You don't have to write a CopyTree function, the merge function can take care of it.

The interviewer can watch for experience indicators, like having the merge function handle both nil arguments rather than having duplicated tests for non-nil in the code. The interviewer could elicit more from the candidate by asking for test cases, or if the candidate has missed a trick, suggesting a test case that triggers undesired behavior.

Daily Coding Problem: Problem #405 [Hard]

This problem was asked by Apple.

Given a tree, find the largest tree/subtree that is a BST.

Given a tree, return the size of the largest tree/subtree that is a BST.

Analysis

My version does this from the bottom up:

A nil pointer (it's in Go) is a zero-sized BST.
A leaf node is a 1-node-sized BST.
If a node's value is greater than it's left-child's value, and less than it's right-child's value, and both right and left sub-trees are BSTs, then the size of the BST is 1 + size of left-subtree + size of right-subtree.
It's possible for a node to not be the root of a Binary Search Tree, or one or the other children to not have the BST property. In that case, the code decides to use the largest of the 2 sub-tress that have the BST property.

I used Go's multiple returns from a function to indicate the size of the largest BST in the sub-tree, and whether or not the node is the root of a BST.

My faith in checking for nil pointers on entrance to the recursive function only grows. This lets you keep your code cleaner, without redundant left- and right-child checks for nil that visually clutter the code.

I think this algorithm runs in O(n) where n is the number of nodes in the whole tree. I don't think there's a better run time, given that the algorithm has to check every interior node's value against the values of the left and right child nodes.

As far as an interview problem goes, it's pretty good. A candidate can solve it using the usual recursive algorithm reasoning, finding a base case (nil pointer or leaf node) as the formal argument node, then working through what the recursive function has to do to provide the desired answer.

The recursive function has to watch for a few cases, like one child pointer nil, the other non-nil, and the cases of current node as root of BST vs not root of BST.

My solution assumes that the interviewer wants to find binary search trees that go all the way to the leaf nodes. It's possible that the interviewer wanted a BST that encompasses only part of the entire tree. That would be a harder problem, but the candidate could do some elaborate checking on left and right child values even if they don't comprise roots of BSTs themselves. This seems kind of nonsensical, though.

It looks to me like the interviewer could watch a candidate reason out the recursive function, especially the "one nil, one non-nil child node" cases to see if the candidate has a good grasp of algorithms. Solving this problem requires enough code to ensure that the candidate can actually write programs in the language in question. Not many candidates will have run into the problem of finding sub-trees that have the BST property at work, so nobody has a memorized solution.

Compared to other "hard" problems, this one really isn't. I'd rate it just a "medium".

Daily Coding Problem: Problem #664 [Easy]

This problem was asked by Google.

Given a binary tree of integers, find the maximum path sum between two nodes. The path must go through at least one node, and does not need to go through the root.

Analysis

I haven't done this one yet.

Daily Coding Problem: Problem #702 [Medium]

This problem was asked by Google.

Given the root to a binary tree, implement serialize(root), which serializes the tree into a string, and deserialize(s), which deserializes the string back into the tree.

For example, given the following Node class

class Node:
    def __init__(self, val, left=None, right=None):
        self.val = val
        self.left = left
        self.right = right

The following test should pass:

node = Node('root', Node('left', Node('left.left')), Node('right'))
assert deserialize(serialize(node)).left.left.val == 'left.left'

Analysis

"Medium"!?! The serializing isn't particularly difficult, but deserializing any Lisp S-expression type representation will be tough. If you allow whitespace, it becomes a lot more difficult. Part of the difficulty of this problem is designing the serialized text representation.

My tree package implements this in create.go. Specifically, functions tree.CreateFromString, tree.CreateNumericFromString do the deserialization, and function tree.Printf does the serialization. These functions get used in many of the other problem solutions.

My code is in Go, not Python as above, but it meets the problem statement. The utility program to create GraphViz output of a tree almost meets the problem statement.

As an interview question this might not be bad. The interviewer gets to see a lot of coding for the de-serialization part. The candidate can note where corner cases like "Missing ')'" might cause problems. A node with a right child, but no left child is also a test case. This would be a harsh problem for whiteboarding - it's harder than many other "[Hard]" coding problems.

Daily Coding Problem: Problem #748 [Easy]

This problem was asked by Apple.

Given the root of a binary tree, find the most frequent subtree sum. The subtree sum of a node is the sum of all values under a node, including the node itself.

For example, given the following tree:

  5
 / \
2  -5

Return 2 as it occurs twice: once as the left leaf, and once as the sum of 2 + 5 - 5.

Analysis

Code

This is easily done recursively. A leaf node's sum is its value. An interior node's sum is its value plus the values of the left and right subtrees, if either. The tricks are:

Stop recursing at a leaf node.
Get the subtree sums to some place where you can figure out which one occurs most often. This is a little tricky for recursive functions. It requires either a global collector of sums, or extra arguments to the recursive function.
Realize that the current node's subtree sum gets used as both a child's subtree sum, and its own subtree sum

Since I wrote this program in Go, I chose to send a chan int in the arguments of the recursive function. Each invocation of the function checks to see if it should top recursing. If not, recurse on the left child, which returns the subtree sum, and recurse on the right child, which also returns a subtree sum. Add the current node's value to the subtree sums, write that sum to the pipe, and return it.

If the recursive function runs in its own goroutine, the main thread can just collect subtree sums from the channel, and count them. The recursive function's return value gets used by callers, the channel gets used to count subtree sums.

Interview Analysis

Whoever conceived of and used this as an interview problem was interested in seeing some algorithmic thinking. It's basically a post-order binary tree traversal, plus keeping track of, and sorting, subtree sums.

I'll go with the "[Easy]" rating, but note that it's a decent problem for a whiteboard. The interviewer can indeed see some algorithmic thinking, and possibly a little program design. The way I did it is particular to Go, but it could be done in C or Java or Python with only a little modification. The candidate can exhibit coding skills and design skills, and show that they can reason about correct results.

Daily Coding Problem: Problem #750 [Medium]

This problem was asked by Jane Street.

Generate a finite, but an arbitrarily large binary tree quickly in O(1).

That is, generate() should return a tree whose size is unbounded but finite.

Analysis

Code

I was unsatisfied with this program. After writing it, re-reading the problem statement makes me think my solution isn't what they want at all. Setting the number of nodes in the "random" tree might not be what the interviewer desires.

Beyond that, what is the "O(1)" associated with? For sorting, the number of swaps is important. For making a bunch of "random" things you have to generate a bunch of at least pseudo-random numbers, maybe to have randomly-valued nodes in the binary tree, or maybe to decide which branch of a node to add any further nodes.

If we use "choosing a random number" as the operation we track O() for, my program misses the goal. My program creates O(n) random numbers for node values. It also creates O(log₂n) random numbers to decide how to partition an array of node values into sub-trees. I failed this interview.

Around the web

Stackoverflow solutions
Daily Coding Problem blog solutions

The consensus appears to be "generate the tree lazily". That is, node-generation happens if and when a node gets accessed. This strikes me as cheating.

Interview Analysis

Are you willing to cheat to pass an interview? Maybe that's a good thing for something like a pen testing position, but outside of that, I don't know. I would be unlikely to come up with lazy tree generation as a solution during an interview. Maybe I could find it if this was a take-home problem.

In any case, lazy generation seems like it's very vaguely gestured to by the problem statement. If the candidate should decide on lazy generation, they should probably ask questions of the interviewer to see if that's what's desired.

It's entirely possible that the question is deliberately loosely phrased to encourage candidates to ask questions, allowing the interviewer to see how a candidate thinks.

Daily Coding Problem: Problem #751 [Hard]

Also, Daily Coding Problem: Problem #808 [Hard]

This problem was asked by Palantir.

Typically, an implementation of in-order traversal of a binary tree has O(h) space complexity, where h is the height of the tree. Write a program to compute the in-order traversal of a binary tree using O(1) space.

Analysis

This question wants a "Morris traversal".

Interview Analysis

Daily Coding Problem: Problem #793 [Medium]

This problem was asked by Yahoo.

Recall that a full binary tree is one in which each node is either a leaf node, or has two children. Given a binary tree, convert it to a full one by removing nodes with only one child.

For example, given the following tree:

         0
      /     \
    1         2
  /            \
3                 4
  \             /   \
    5          6     7

You should convert it to:

     0
  /     \
5         4
        /   \
       6     7

Analysis

Code

This can be solved recursively, by removing single-children from the bottom up. Recurse to leaf nodes, then on the way back up the tree, return the node itself if it has either no children (leaf) or both children. Otherwise return the non-nil child node, or nil if the node is a leaf node. The recursive function will look like it's resetting both left and right child node pointers, but at max, only one gets reset.

There's room to screw up by not freeing the node's pointer if it has only one child and you're working in a language without garbage collection, like ANSI C.

Interview Analysis

Why is is one "[Medium]", yet #748 (count of subtrees' sums) rated "[Easy]"?

This is at core a post-order traversal, where all the "work" gets done post-order. There's a wee bit of insight to see that the recursive function has to reset both left and right child pointers, and there's a tricky little bit of coding in deciding what to return, but it's not harder than #748.

It might actually make a decent whiteboarding problem because it doesn't require too many lines of code, but does require some thinking.

Daily Coding Problem: Problem #307

This problem was asked by Oracle.

Given a binary search tree, ind the floor and ceiling of a given integer. The floor is the highest element in the tree less than or equal to an integer, while the ceiling is the lowest element in the tree greater than or equal to an integer.

If either value does not exist, return None.

Analysis

I haven't done this one yet.

Interview Analysis

Daily Coding Problem: Problem #36

Also: Daily Coding Problem: Problem #992 [Medium]

This problem was asked by Dropbox.

Given the root to a binary search tree, find the second largest node in the tree.

Analysis

The largest valued node in a binary search tree is the right-most node. The second-largest-valued node depends on the shape of the tree.

    1        2        1
   / \      /          \
  0   2    1            2

The largest valued node in the above 3 trees has the value 2, but the second-largest-value can be its parent or its left child.

My code prints the largest and second-largest values.

I did this problem a second time, because I didn't remember doing it a first time. My second attempt is quite close to my first attempt.

The candidate would probably want to point out that test cases for all 3 tree shapes would be good. Asking about a single-node tree would be good. Technically a single-node tree is a binary search tree, but it wouldn't have a second largest value.

Interview Analysis

This is relatively easy for a Daily Coding Problem "[Medium]" problem.

The problem statement is problematic. "Second largest node in the tree" confuses "node" with "value of node's data". So what does "find the second largest node" mean? I'm sure most people will find the 2nd-largest-value of nodes in the tree, but it's certainly possible to decide that the "second largest node" is the parent of the 2nd-largest population subtree. Does "find the node" mean return a pointer to that node, or returning the second-largest-value of nodes in the tree, or just printing out the second-largest-value as I did?

This problem statement should be re-phrased to ask for something less ambiguous. An interviewer asking this question will get a lot of clarifying questions from some candidates, and some shocking code from a few candidates that both think unconventionally, and are either afraid to ask questions, or are so sure of themselves that they just do some bizarro thing like figure out which immediate child of the root node has the biggest tree.

Of the more conventionally-minded candidates, or those candidates that understand the distinction between "data" and "data structure", this problem could potentially weed out those that don't know about binary trees.

You could accomplish finding the second-largest-value several ways:

Traverse tree to find largest value. Do second traverse to find second largest value.
Traverse tree in-order to produce a sorted array or list, find second largest value from the sorted array.
Some clever recursive solution, which is what I did. Twice.

One lesson in this one for candidates is: ask clarifying questions. Do you understand what "largest node" means? What does "find the node" mean - return the node, return the value, or just output something when the code receives that node as argument?

Another lesson is that these coding problems aren't always well thought out, or even able to help an interviewer ascertain your skills.

Daily Coding Problem: Problem #936 [Medium]

This problem was asked by Google.

Given a binary search tree and a range [a, b] (inclusive), return the sum of the elements of the binary search tree within the range.

For example, given the following tree:

and the range [4, 9], return 23 (5 + 4 + 6 + 8).

Analysis

I haven't done this one yet.

Interview Analysis

Daily Coding Problem: Problem #1020 [Easy]

This problem was asked by Google.

Given the root of a binary search tree, and a target K, return two nodes in the tree whose sum equals K.

For example, given the following tree and K of 20

Return the nodes 5 and 15.

Analysis

I haven't done this one yet.

The statement says "binary search tree", so instead of traversing the whole tree to find two nodes, it might pay off to use the BST property with some node's value to find K - value, if it exists. If it doesn't exist, move to another node. Use the BST property to decide which subtree to move to.

Interview Analysis

Daily Coding Problem: Problem #1057 [Easy]

This problem was asked by Amazon.

Given an integer N, construct all possible binary search trees with N nodes.

Code

Analysis

The wording is confusing. "Binary search trees" means that the nodes in a binary tree have values that can be compared, and that the nodes appear in the binary trees according to that comparison. You almost have to assume that by "N nodes", whoever wrote and used this question meant "N unique valued nodes". If you allow BSTs with duplicate value nodes, you can't distinguish certain trees.

I'm assuming that "N nodes" means "N nodes with unique integer values".

If you construct BSTs by inserting values, the shape or configuration of the resulting tree depends on the order of the insertion. Even simple, 2-value trees, with nodes of value 1 and 2, can have different shapes:

Inserting values 1 and 2 in that order gives you a tree like this:

    1
     \
      2

Inserting values 1 and 2 in the order 2, 1 gives you a tree like this:

    2
   /
  1

My solution finds all permutations of an array of N integers, values 1 through N, then constructs BSTs by inserting the array values in order. This should find all valid BSTs for N unique integers, because you can certainly order N integers and then map keys to them with values 1 through N.

I used [Heap's Algorithm] to permute the array, constructing BSTs when Heap's algorithm recursion bottoms out. There are duplicate BSTs, so it can't just print a string representation of the BST. I have the code putting a string representation of the BSTs it constructs on a channel, and another goroutine outputting only unique string representations.

This is of O(N!) time complexity because of the use of permutations to find all BSTs.

Interview Analysis

I do not see how this is "[Easy]". I'd call it more like "[Hard]".

At the very least, the candidate who completes this has to have familiarity with binary trees, and what "binary search tree" means. Doing it with my O(N!) algorithm, you have to know a lot more, Heap's algorithm isn't blatantly obvious.

If the candidate has done this sort of thing before, they know that the Nth Catalan Number is the count of number of BSTs containing N unique keys. This fact can be used to check if a program returns the right number of BSTs. Since the count of distinct binary trees is a Catalan number, this means that all binary tree shapes can be generated by inserting N nodes in particular orders.

Daily Coding Problem: Problem #1072 [Easy]

This problem was asked by Facebook.

Given a binary tree, return the level of the tree with minimum sum.

Analysis

The problem seems poorly phrased: it asks the candidate to assume numerically-valued nodes, but are they integer values or floating point? Summing floating point numbers of radically varying magnitude can have unintuitive results.

I'll assume integer value data of tree nodes.

This could be done more than one way:

Breadth-first traverse of the tree. I suppose you could keep track of the minimum sum of previous levels' nodes and quit doing work if the current level summed higher than the current minimum sum.
Depth-first recursive traverse of the entire tree, tracking the level via another argument to the recursive function. You'd have to pass around a data structure of some type that would keep a sum of each level encountered.
If you wrote the program in Go, you could have a goroutine traverse the tree, writing a struct with level and node value for each node it encounters. The main goroutine could keep sums keyed by level.

I chose to organize the task using (3), but I wrote 2 programs.

I chose to use a Go map[int]int for my first program where the level is key (level of root node is zero), and the value matching that key is the sum of all nodes' values at that level.

I chose to use a slice of integers, Go []int type, for my second program. This takes advantage of the observation that the maximum level encountered is only 1 greater than the maximum level known before. This arises from the structure of binary trees.

Interview Analysis

"Easy" is a good rating for this problem, which constitutes knowing what a binary tree is, knowing how to program a traverse of a tree, and being able to select or design a data structure to keep track of the sums of nodes' values by level.

I think that's about all that a candidate can show, except maybe by asking questions about the problem statement. It's definitely all that an interviewer can expect, without asking further questions about the problem.

Generate all binary tree "shapes"

I haven't see this as a Daily Coding Problem, but it seems like a question that could come up.

I would phrase it as: calculate all the possible shapes (or configurations, or topologies) of a binary tree with N nodes.

Code

Analysis

There's several ways to do this, but I couldn't figure anything out on my own. I used an algorithm I found on stack exchange.

Interview Analysis

I would definitely have a lot of trouble with this problem in an interview, so I rate it "[Hard]", and I hope it never comes up in one of my job interviews.

Turn a tree into Wikipedia's "succinct" representation

This isn't a problem I've ever seen come up, but it seems like something that could be an interview problem.

A succinct encoding of a binary tree is one which occupies close to minimum possible space, as established by information theoretical lower bounds. The number of different binary trees of n nodes is C_n, the nth Catalan number (assuming we view trees with identical structure as identical). For large n this is about 4ⁿ, thus we need at least about log₂(4 n) = 2n bits to encode it. A succinct binary tree's shape therefore would occupy 2n bits.

One simple representation which meets this bound is to visit the nodes of the tree in preorder, outputting "1" for an internal node and "0" for a leaf, This ignores nodes' data, representing only the trees' shapes.

Code

Interview Analysis

My phrasing of the problem statement may be overly specific, keeping candidates from demonstrating any algorithmic insight.

"Leaf" node in this case is actually the null/nil pointer value of a data carrying node. What everyone else on earth would call a leaf node has two of these "leaf" node children, in the form of non-valued-pointers. That unusual terminology may spoil this as a good interview question.

Daily Coding Problem: Problem #1107 [Hard]

This problem was asked by Adobe.

You are given a tree with an even number of nodes. Consider each connection between a parent and child node to be an "edge". You would like to remove some of these edges, such that the disconnected subtrees that remain each have an even number of nodes.

For example, suppose your input was the following tree:

In this case, removing the edge (3, 4) satisfies our requirement.

Write a function that returns the maximum number of edges you can remove while still satisfying this requirement.

Analysis

I haven't done this yet.

LC 114 [Medium] Flatten Binary Tree to Linked List

Question: Given a binary tree, flatten it to a linked list in-place.

For example, given the following tree:

The flattened tree should look like:

Analysis

This problem loosely describes taking a binary tree as input and returning a linked list with numeric data items in pre-order traversal, The "right child" pointer of each tree node refers to the next list node.

The example is (deliberately?) confusing. The binary tree given is almost a Binary Search Tree, while the linked list has nodes with numeric data in ascending order. You might get baffled by the example, and link the nodes by in-order-traversal of the tree. Maybe that's what the problem statement really wants, and the example is wrong: it's harder to convert a pre-order traverse into a linked list in order. I'm going to assume they want a pre-order-traverse-linked-list.

The problem statement says to flatten the tree "in-place", which presumably rules out a pre-order traversal with a slice or array of tree elements that your program fills in, then walking the array to get linking pointers correct.

I did this five ways:

Purely recursive
Go-style using casual concurrency
Iteratively, using an explicit stack for the pre-order traverse
Different recursion, from here
Iteratively without a stack, from here

All programs pass around pointers with abandon. The recursive and iterative programs might as well be written in C, but the Go-style, casually concurrent version, could not be. The casually concurrent version has clearer recursion, It does have about 30% more lines than the recursive version, so this may be a wash.

Recursive and concurrent versions use the call stack to hold left and right child pointers of a tree node so that an arbitrary nodes Left and Right data elements can be set to nil or used as the "next" linked list pointer. This is key to getting the linked list in pre-order-traverse-order. If those pointer values don't get saved on the call stack, some other explicit stack would have to exist to hold them, as illustrated by the iterative version. Does either of the recursive or iterative solutions count as "in place" if they use the call stack to keep track of left/right child pointers and next list pointers?

Interview Analysis

"Medium" might constitute a correct rating for this problem, but I think you could argue for "Hard". The candidate has to know pre-order traverse, and get some oddly tricky link-pointer manipulation correct. The problem deals with two different data structures. The problem statement and example seem less than straightforward.

If I go this as an interview question, I would talk though my design, noting the places I wasn't sure of, or believed that experimentation would let me get correct.

Test cases would be important to note: 1-element, complete binary tree, all-left-branch binary tree inputs would be good tests. Testing some arrangement of data values to prove pre-order-traversal would be good.

The interviewer who posed this question should be ashamed of themselves. It's seems deliberately misleading. The example doesn't really add any information.

Name		Name	Last commit message	Last commit date
Latest commit History 301 Commits
tree		tree
README.md		README.md
all_bst.go		all_bst.go
all_trees.go		all_trees.go
bottomview.go		bottomview.go
bous.go		bous.go
breadthfirst.go		breadthfirst.go
bst_property.go		bst_property.go
bst_second.go		bst_second.go
bst_value_sum.go		bst_value_sum.go
countnodes.png		countnodes.png
cousins.go		cousins.go
cousins2.go		cousins2.go
createheap.go		createheap.go
drawtree.go		drawtree.go
full_tree.go		full_tree.go
go.mod		go.mod
ht_balanced.go		ht_balanced.go
inorder_successor.go		inorder_successor.go
inorder_successor.png		inorder_successor.png
invert.go		invert.go
invert2.go		invert2.go
invertnode.go		invertnode.go
largest_bst.go		largest_bst.go
lca.go		lca.go
level_sum1.go		level_sum1.go
level_sum2.go		level_sum2.go
list_from_tree.go		list_from_tree.go
list_from_tree2.go		list_from_tree2.go
list_from_tree3.go		list_from_tree3.go
list_from_tree4.go		list_from_tree4.go
list_from_tree5.go		list_from_tree5.go
locktree.go		locktree.go
makefile		makefile
merge.go		merge.go
merge.png		merge.png
minht.png		minht.png
minimal_ht_tree.go		minimal_ht_tree.go
minimal_ht_tree2.go		minimal_ht_tree2.go
minpathsum.go		minpathsum.go
node_access_count.png		node_access_count.png
node_accesses.png		node_accesses.png
nread.go		nread.go
numberedtree.png		numberedtree.png
order.go		order.go
prune_tree.go		prune_tree.go
rand.go		rand.go
rando1.go		rando1.go
readtree.go		readtree.go
reconstruct.go		reconstruct.go
reconstruct2.go		reconstruct2.go
runtests		runtests
search.go		search.go
secondlargest.go		secondlargest.go
serialize.go		serialize.go
subtree_sum.go		subtree_sum.go
succint.go		succint.go
testmin.go		testmin.go
testmin.load		testmin.load
traverse.go		traverse.go
tree_count.go		tree_count.go
tree_count3.go		tree_count3.go
tree_depth.go		tree_depth.go
tree_depth2.go		tree_depth2.go
tree_depth_chan.go		tree_depth_chan.go
tree_depth_node.go		tree_depth_node.go
tree_paths.go		tree_paths.go
unival.go		unival.go

bediger4000/binary_tree

Folders and files

Latest commit

History

Repository files navigation

Binary Tree Puzzles and Interview Questions

Building

Questions and programs

Daily Coding Problem: Problem #842 [Medium]

Daily Coding Problem: Problem #752 [Easy]

Analysis

Daily Coding Problem: Problem #622 [Easy]

Analysis

Reconstruct a tree from traversals

Analysis

Interview Analysis

Return all paths from the root to leaves

Analysis

Daily Coding Problem: Problem #540 [Easy]

Analysis

Cousin Nodes #1

Daily Coding Problem: Problem #487 [Medium]

Analysis

Cousin Nodes #2

Analysis

Prune a tree so that subtrees containing all 0s are removed

Analysis

Minimum height binary search tree

Analysis

Interview Analysis

Daily Coding Problem: Problem #545 [Hard]

Analysis

Count nodes in less than linear time

Incorrect Algorithm Analysis 1

Another Algorithm Analysis

Interview Analysis

Daily Coding Problem: Problem #502 [Easy]

Analysis

Daily Coding Problem: Problem #133

Analysis

Daily Coding Problem: Problem #490 [Medium]

Analysis

Daily Coding Problem: Problem #490 [Medium]

Analysis

Daily Coding Problem: Problem #482 [Medium]

Analysis

Daily Coding Problem: Problem #475 [Medium]

Analysis

Daily Coding Problem: Problem #644 [Easy]

Build and run Unival tree program

Unival tree analysis

Daily Coding Problem: Problem #651 [Medium]

Analysis

Daily Coding Problem: Problem #442 [Hard]

Analysis

Daily Coding Problem: Problem #422 [Easy]

Analysis

Daily Coding Problem: Problem #405 [Hard]

Analysis

Daily Coding Problem: Problem #664 [Easy]

Analysis

Daily Coding Problem: Problem #702 [Medium]

Analysis

Daily Coding Problem: Problem #748 [Easy]

Analysis

Interview Analysis

Daily Coding Problem: Problem #750 [Medium]

Analysis

Around the web

Interview Analysis

Daily Coding Problem: Problem #751 [Hard]

Analysis

Interview Analysis

Daily Coding Problem: Problem #793 [Medium]

Analysis

Interview Analysis

Daily Coding Problem: Problem #307

Analysis

Interview Analysis

Daily Coding Problem: Problem #36

Packages