Replace multiple spaces with one space in a string

Question

How would I do something in c++ similar to the following code:

//Lang: Java
string.replaceAll("  ", " ");

This code-snippet would replace all multiple spaces in a string with a single space.

possible duplicate of Interview Question : Trim multiple consecutive spaces from a string — Cubbi
– Cubbi, Commented Dec 2, 2011 at 20:54

diralik · Accepted Answer · 2018-01-17 22:54:37Z

85

bool BothAreSpaces(char lhs, char rhs) { return (lhs == rhs) && (lhs == ' '); }

std::string::iterator new_end = std::unique(str.begin(), str.end(), BothAreSpaces);
str.erase(new_end, str.end());

How this works. The std::unique has two forms. The first form goes through a range and removes adjacent duplicates. So the string "abbaaabbbb" becomes "abab". The second form, which I used, takes a predicate which should take two elements and return true if they should be considered duplicates. The function I wrote, BothAreSpaces, serves this purpose. It determines exactly what it's name implies, that both of it's parameters are spaces. So when combined with std::unique, duplicate adjacent spaces are removed.

Just like std::remove and remove_if, std::unique doesn't actually make the container smaller, it just moves elements at the end closer to the beginning. It returns an iterator to the new end of range so you can use that to call the erase function, which is a member function of the string class.

Breaking it down, the erase function takes two parameters, a begin and an end iterator for a range to erase. For it's first parameter I'm passing the return value of std::unique, because that's where I want to start erasing. For it's second parameter, I am passing the string's end iterator.

edited Jan 17, 2018 at 22:54

diralik

7,3734 gold badges34 silver badges55 bronze badges

answered Dec 2, 2011 at 20:20

Benjamin Lindley

104k11 gold badges210 silver badges285 bronze badges

Sign up to request clarification or add additional context in comments.

13 Comments

Seth Carnegie Over a year ago

Cool, never seen this before. +1

Benjamin Lindley Over a year ago

@Seth: Neither have I, it just came to me suddenly.

Seth Carnegie Over a year ago

You could even do template<char Remove> bool BothAre(char lhs, char rhs) { return lhs == rhs && lhs == Remove; } then str.erase(std::unique(str.begin(), str.end(), BothAre<' '>), str.end()); to make it a tiny bit generic and usable for other characters too

Seth Carnegie Over a year ago

@user386911 std::unique moves all consecutive duplicate characters in between the two iterators it receives to the end iterator, so that all the characters end up at the end of the string. It then returns the iterator to the beginning of all the characters it moved to the end of the string, and you pass that iterator to str.erase which takes two iterators and removes all the characters between them. tl;dr: all the duplicate spaces end up at the end of the string via unique, then erase removes them.

Benjamin Lindley Over a year ago

@Seth: "all the characters end up at the end of the string" <-- Common myth. Neither std::unique nor std::remove are required to do this, and I'm not aware of any implementation where they do. They just copy or move the non-duplicate elements from the end toward the front.

|

paul23 · Accepted Answer · 2011-12-02 20:50:50Z

So, I tried a way with std::remove_if & lambda expressions - though it seems still in my eyes easier to follow than above code, it doesn't have that "wow neat, didn't realize you could do that" thing to it.. Anyways I still post it, if only for learning purposes:

bool prev(false);
char rem(' ');
auto iter = std::remove_if(str.begin(), str.end(), [&] (char c) -> bool {
    if (c == rem && prev) {
        return true;
    }
    prev = (c == rem);
    return false;
});
in.erase(iter, in.end());

EDIT realized that std::remove_if returns an iterator which can be used.. removed unnecessary code.

str.erase(iter, str.end()); instead of in.erase(iter, in.end());

tlaxcala · Accepted Answer · 2016-09-03 16:42:47Z

4

A variant of Benjamin Lindley's answer that uses a lambda expression to make things cleaner:

std::string::iterator new_end = 
        std::unique(str.begin(), str.end(),
        [=](char lhs, char rhs){ return (lhs == rhs) && (lhs == ' '); }
        );
str.erase(new_end, str.end());

answered Sep 3, 2016 at 16:42

tlaxcala

412 bronze badges

1 Comment

Liroo Pierre Over a year ago

getting [=] is useless, just use []

davidicus · Accepted Answer · 2015-06-26 19:09:36Z

2

Why not use a regular expression:

boost::regex_replace(str, boost::regex("[' ']{2,}"), " ");

edited Jun 26, 2015 at 19:09

davidicus

6491 gold badge6 silver badges17 bronze badges

answered Feb 21, 2015 at 1:06

Mike Jiang

2091 gold badge2 silver badges10 bronze badges

4 Comments

davidicus Over a year ago

C++11 also includes the regex library, which can be used if using boost is an issue.

Mark Beckwith Over a year ago

There's no good reason, other than its just more fun to do algorithm jujitsu.

Owolabi Ezekiel Tobiloba Over a year ago

boost isnt part of the standard library is it?

Anshuman Kumar Over a year ago

@OwolabiEzekielTobiloba no it is not.

Aurasphere · Accepted Answer · 2017-10-05 12:58:37Z

1

how about isspace(lhs) && isspace(rhs) to handle all types of whitespace

edited Oct 5, 2017 at 12:58

Aurasphere

4,03912 gold badges53 silver badges79 bronze badges

answered Oct 5, 2017 at 11:54

Patrick Neary

112 bronze badges

1 Comment

Vijayanath Viswanathan Over a year ago

Please answer after you try it on your own and with a sample working code

Collectives™ on Stack Overflow

Replace multiple spaces with one space in a string

5 Answers 5

13 Comments

1 Comment

1 Comment

4 Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

13 Comments

1 Comment

1 Comment

4 Comments

1 Comment

Linked

Related