How do I call a program in C, setting up standard pipes?

Earlier I showed how to call a program in C. This works by first forking the current process, then replacing the child process with the new program image. But how do we talk to this new process? We must set up the new process’s standard pipes (standard in, out, error). This is a fair bit of work.

Here’s the original call function, which starts a new process but does not set up its pipes (or do any error checking):

pid_t call(char* argv[]) {
  pid_t pid = fork();
  if (pid == 0) {
    char* envp[] = { NULL };
    execve(argv[0], argv, envp);
  } else {
    return pid;
  }
}

Our updated function will have the following new signature, writing the new process’s information into a struct. The struct contains descriptors for the write-end of the subprocess’s standard-in pipe (so we can write to the subprocess), and descriptors for the read-ends of the subprocess’s standard-out and standard-error pipes (so we can read back from the subprocess).

struct subprocess {
  pid_t pid;
  int stdin;
  int stdout;
  int stderr;
};

void call(char* argv[], struct subprocess * p) {
  // ...
}

We’ll call the function like this:

struct subprocess proc;
call(argv, &proc);

Before forking, we must create the new pipes. We create three new pipes, which we’ll later connect up to the expected descriptors.

void mk_pipe(int fds[2]) {
  if (pipe(fds) == -1) { perror("Could not create pipe"); exit(1); }
}

void call(char* argv[], struct subprocess * p) {
  int child_in[2]; int child_out[2]; int child_err[2];
  mk_pipe(child_in); mk_pipe(child_out); mk_pipe(child_err);
  pid_t pid = fork();
  // ...
}

After forking, the file descriptor table is cloned. This means both processes have the same references to every pipe. Recap which pipes we have: those for the parent process (its standard in, out, and error) and those for the child (the ones we just created). Each of those pipes has a read end and a write end. After forking, here is how both processes can refer to those pipe ends:

pipe end	parent	child
parent stdin (read)	`0`	`0`
parent stdout (write)	`1`	`1`
parent stderr (write)	`2`	`2`
child stdin (read)	`stdin[0]`	`stdin[0]`
child stdin (write)	`stdin[1]`	`stdin[1]`
child stdout (read)	`stdout[0]`	`stdout[0]`
child stdout (write)	`stdout[1]`	`stdout[1]`
child stderr (read)	`stderr[0]`	`stderr[0]`
child stderr (write)	`stderr[1]`	`stderr[1]`

start state

What a mess! This needs some reshuffling:

The parent wants to access its pipe ends via the proc struct; not via the tuple arrays.
The child wants to access its pipe ends via standard file descriptors 0, 1 and 2; not via the tuple arrays.
The child should not have access to the parent’s standard pipes.
Each pipe end should only have one reference; not two. We must close the non-owner’s references.

It should end up like this:

pipe end	parent	child
parent stdin (read)	`0`	-
parent stdout (write)	`1`	-
parent stderr (write)	`2`	-
child stdin (read)	-	`0`
child stdin (write)	`proc.stdin`	-
child stdout (read)	`proc.stdout`	-
child stdout (write)	-	`1`
child stderr (read)	`proc.stderr`	-
child stderr (write)	-	`2`

desired state

Let’s start by closing some descriptors. Each end of a new pipe should only be referenced by one process. To remove references to a pipe end, we call close, like this:

void call(char* argv[], struct subprocess * p) {
  int child_in[2]; int child_out[2]; int child_err[2];
  mk_pipe(child_in); mk_pipe(child_out); mk_pipe(child_err);
  pid_t pid = fork();
  if (pid == 0) {
    close(0); close(1); close(2);                                 // close parent pipes
    close(child_in[1]); close(child_out[0]); close(child_err[0]); // unused child pipe ends
    // ...
  } else {
    close(child_in[0]); close(child_out[1]); close(child_err[1]); // unused child pipe ends
    // ...
  }
}

After closeing appropriate ends in each process, we end up with:

pipe end	parent	child
parent stdin (read)	`0`	-
parent stdout (write)	`1`	-
parent stderr (write)	`2`	-
child stdin (read)	-	`stdin[0]`
child stdin (write)	`stdin[1]`	-
child stdout (read)	`stdout[0]`	-
child stdout (write)	-	`stdout[1]`
child stderr (read)	`stderr[0]`	-
child stderr (write)	-	`stderr[1]`

after closing

Much better. The remaining problems are that the child pipes are not referenced in the right way. The parent wants to access its pipe ends via the proc struct. To fix this, we copy the descriptors to that struct, and forget the tuple arrays.

parent moved

The child’s side is trickier. The big problem is that the child’s standard descriptors (0, 1, and 2) point to the parent pipes, but we want them to point to the newly created child pipes. In other words, we want to move the reference from, say, child_in[0] to 0.

In UNIX, we move a file descriptor by duplicating it, then closing the old one. The system call is dup2:

#include <unistd.h>
int dup2(int fildes, int fildes2);

After calling dup2(fd1, fd2), the resource previously referenced by fd1 is now also referenced by fd2. Here are the references after duplicating them:

after dup2

Finally, we close the old descriptors:

void mv_fd(int fd1, int fd2) {
  dup2(fd1, fd2);
  close(fd1);
}

Let’s call mv_fd to place the child’s pipes in the standard locations:

  // ...
  if (pid == 0) {
    close(0); close(1); close(2);                                 // close parent pipes
    close(child_in[1]); close(child_out[0]); close(child_err[0]); // unused child pipe ends
    mv_fd(child_in[0], 0); mv_fd(child_out[1], 1); mv_fd(child_err[1], 2); // copy new fds to standard locations
    // ...
  }
  // ...
}

after move

Done! When we call execve, the page table for the child process will be replaced, so we can forget it. Here’s the full code for call:

// Start program at argv[0] with arguments argv.
// Set up new stdin, stdout and stderr.
// Puts references to new process and pipes into `p`.
void call(char* argv[], struct subprocess * p) {
  int child_in[2]; int child_out[2]; int child_err[2];
  pipe(child_in); pipe(child_out); pipe(child_err);
  pid_t pid = fork();
  if (pid == 0) {
    close(0); close(1); close(2);                                 // close parent pipes
    close(child_in[1]); close(child_out[0]); close(child_err[0]); // unused child pipe ends
    mv_fd(child_in[0], 0); mv_fd(child_out[1], 1); mv_fd(child_err[1], 2);
    char* envp[] = { NULL };
    execve(argv[0], argv, envp);
  } else {
    close(child_in[0]); close(child_out[1]); close(child_err[1]); // unused child pipe ends
    p->pid = pid;
    p->stdin = child_in[1];   // parent wants to write to subprocess child_in
    p->stdout = child_out[0]; // parent wants to read from subprocess child_out
    p->stderr = child_err[0]; // parent wants to read from subprocess child_err
  }
}

For simplicity, the code above does no error checking after system calls. You might want to use the following instead of the raw system calls:

void close(int fd) {
  if (close(fd) == -1) { perror("Could not close pipe end" ); exit(1); }
}

void mk_pipe(int fds[2]) {
  if (pipe(fds) == -1) { perror("Could not create pipe"); exit(1); }
}

void mv_fd(int fd1, int fd2) {
  if (dup2(fd1,  fd2) == -1) { perror("Could not duplicate pipe end"); exit(1); }
  close(fd1);
}

Tagged #c, #programming, #pipes, #posix, #fave.

More by Jim

What does the dot do in JavaScript?

foo.bar, foo.bar(), or foo.bar = baz - what do they mean? A deep dive into prototypical inheritance and getters/setters. 2020-11-01

Smear phishing: a new Android vulnerability

Trick Android to display an SMS as coming from any contact. Convincing phishing vuln, but still unpatched. 2020-08-06

A probabilistic pub quiz for nerds

A “true or false” quiz where you respond with your confidence level, and the optimal strategy is to report your true belief. 2020-04-26

Time is running out to catch COVID-19

Simulation shows it’s rational to deliberately infect yourself with COVID-19 early on to get treatment, but after healthcare capacity is exceeded, it’s better to avoid infection. Includes interactive parameters and visualizations. 2020-03-14

The inception bar: a new phishing method

A new phishing technique that displays a fake URL bar in Chrome for mobile. A key innovation is the “scroll jail” that traps the user in a fake browser. 2019-04-27

The hacker hype cycle

I got started with simple web development, but because enamored with increasingly esoteric programming concepts, leading to a “trough of hipster technologies” before returning to more productive work. 2019-03-23

Project C-43: the lost origins of asymmetric crypto

Bob invents asymmetric cryptography by playing loud white noise to obscure Alice’s message, which he can cancel out but an eavesdropper cannot. This idea, published in 1944 by Walter Koenig Jr., is the forgotten origin of asymmetric crypto. 2019-02-16

How Hacker News stays interesting

Hacker News buried my post on conspiracy theories in my family due to overheated discussion, not censorship. Moderation keeps the site focused on interesting technical content. 2019-01-26

My parents are Flat-Earthers

For decades, my parents have been working up to Flat-Earther beliefs. From Egyptology to Jehovah’s Witnesses to theories that human built the Moon billions of years in the future. Surprisingly, it doesn’t affect their successful lives very much. For me, it’s a fun family pastime. 2019-01-20

The dots do matter: how to scam a Gmail user

Gmail’s “dots don’t matter” feature lets scammers create an account on, say, Netflix, with your email address but different dots. Results in convincing phishing emails. 2018-04-07

The sorry state of OpenSSL usability

OpenSSL’s inadequate documentation, confusing key formats, and deprecated interfaces make it difficult to use, despite its importance. 2017-12-02

I hate telephones

I hate telephones. Some rational reasons: lack of authentication, no spam filtering, forced synchronous communication. But also just a visceral fear. 2017-11-08