How to convert a string with regex

How to convert a string with regex - javascript

How can i convert a string with regex, so that it contains just alphabetical (a-z) or a hyphen.
It should get rid " ' ! ? . and so on. Even if they appear multiple times.
// if i have e.g.
var test = '"test!!!"';
// how can i get the value "test"?
Can sombody help. RegEx is totaly new to me.

Simply replace the characters you don't want:
'"test!!!"'.replace(/[^a-z-]/gi, '')
[^a-z-] matches all characters but a-z and the hyphen. The /g flag makes the regexp apply multiple times. The /i flag (optional) makes it match case-insensitively, i.e. not replace upper-case characters.

That's quite simple: You build a character class that matches everything except those chars you want and remove them by replacing each occurence (global flag) with the empty string:
return str.replace(/[^a-z-]/g, "");

str = "hello!! my + name $ is slim-shady";
console.log(str.replace(/[^a-z-]+/g, ''));
$ node src/java/regex/alphanum.js
hellomynameisslim-shady

Use the replace method for any string variable and specify the characters you wish to remove.
Here's an example:
var sampleString = ("Hello World!!"); //Sample of what you have.
var holdData = sampleString.replace(/!!/gi, '');
window.alert(holdData);

var str = "test!!!";
str = str.replace(/[^A-Za-z\-]/g,"");

Related

regex replace only a part of the match

I want to replace only a part of the string of a regex pattern match. I found this answer but I don't get it...
How do I use substitution?
Example of what I want: keep the first slug digit, only replace others
/09/small_image/09x/ > /09/thumbnail/
1st: unknown digit
2nd: "small_image"
3rd: unknown digit + "x"
Here is what I have so far:
var regexPattern = /\/\d\/small\_image\/\d*x/;
var regexPattern = /\/\d\/(small\_image\/\d*x)$1/; ??
var result = regexPattern.test(str);
if (result) {
str = str.replace(regexPattern, 'thumbnail');
}

var input = "/09/small_image/09x/";
var output = input.replace(/(\/\d+\/)small_image\/\d*x/, "$1thumbnail");
console.log(output);
Explanation:
Put the part you want to keep in parentheses, then refer to that as $1 in the replacement string - don't put $1 in your regex. So (\/\d+\/) means to match a forward slash followed by one or more digits, followed by another forward slash.
(Note that you don't need to escape underscores in a regex.)

Go with
var regexPattern = /(\/\d+\/)small\_image\/\d*x/;
and
str = str.replace(regexPattern, '$1thumbnail');
First, you were missing the +. Because 09 are two digits, you need the regexp to match one or more digits (\ḑ would be exactly one). This is accomplished by \d+
Second, everything you match is being removed at first. To get the /09/ part back afterwards, you have to remember it by putting it into brackets in the regexp (...) and afterwards reference it in the replacement via $1
One could as well create other groups and reference them by $2,$3 ...

How to remove everything up until the first character in Javascript?

I'm getting a string which returns an integer first, followed by a series of spaces and then the string I want. My goal is to just get "Moonwalking with Einstein" in this example. I'm using Javascript. What's the best way to achieve this?
2
Moonwalking with Einstein

You can use a str replace with a regex like this:
^[^A-Za-z]+
or
/^[^a-z]+/i
Working demo
var str = ' 2 \n\n\n\n\n\n\n\n Moonwalking with Einstein';
var result = str.replace(/^[^A-Za-z]+/, '');
The idea is to match whatever starts with a character that is not A-Z and a-z and replace it with an empty string and to keep what you want.

You can use JavaScript's replace function for this.
It looks like you are looking to replace digits and instances of two or more consecutive spaces
\d - Matches any digit
| - Or conditional symbol
\s - Matches any whitespace character
{2,} - Matches two or more
/g - Matches all instances
var string = " 2 Moonwalking with Einstein"
string = string.replace(/\d|\s{2,}/g, '')
var result = document.getElementById("result");
result.innerHTML = string;
<p id="result"></p>

Javascript reg exp not right

Here is a string str = '.js("aaa").js("bbb").js("ccc")', I want to write a regular expression to return an Array like this:
[aaa, bbb, ccc];
My regular expression is:
var jsReg = /.js\(['"](.*)['"]\)/g;
var jsAssets = [];
var js;
while ((js = jsReg.exec(find)) !== null) {
jsAssets.push(js[1]);
}
But the jsAssets result is
[""aaa").js("bbb").js("ccc""]
What's wrong with this regular expression?

Use the lazy version of .*:
/\.js\(['"](.*?)['"]\)/g
^
And it would be better if you escape the first dot.
This will match the least number of characters until the next quote.
jsfiddle demo
If you want to allow escaped quotes, use something like this:
/\.js\(['"]((?:\\['"]|[^"])+)['"]\)/g
regex101 demo

I believe it can be done in one-liner with replace and match method calls:
var str = '.js("aaa").js("bbb").js("ccc")';
str.replace(/[^(]*\("([^"]*)"\)[^(]*/g, '$1,').match(/[^,]+/g);
//=> ["aaa", "bbb", "ccc"]

The problem is that you are using .*. That will match any character. You'll have to be a bit more specific with what you are trying to capture.
If it will only ever be word characters you could use \w which matches any word character. This includes [a-zA-Z0-9_]: uppercase, lowercase, numbers and an underscore.
So your regex would look something like this :
var jsReg = /js\(['"](\w*)['"]\)/g;

In
/.js\(['"](.*)['"]\)/g
matches as much as possible, and does not capture group 1, so it matches
"aaa").js("bbb").js("ccc"
but given your example input.
Try
/\.js\(('(?:[^\\']|\\.)*'|"(?:[\\"]|\\.)*"))\)/
To break this down,
\. matches a literal dot
\.js\( matches the literal string ".js("
( starts to capture the string.
[^\\']|\\. matches a character other than quote or backslash or an escaped non-line terminator.
(?:[\\']|\\.)* matches the body of a string
'(?:[\\']|\\.)*' matches a single quoted string
(...|...) captures a single quoted or double quoted string
)\) closes the capturing group and matches a literal close parenthesis
The second major problem is your loop.
You're doing a global match repeatedly which makes no sense.
Get rid of the g modifier, and then things should work better.

Try this one - http://jsfiddle.net/UDYAq/
var str = new String('.js("aaa").js("bbb").js("ccc")');
var regex = /\.js\(\"(.*?)\"\){1,}/gi;
var result = [];
result = str.match (regex);
for (i in result) {
result[i] = result[i].match(/\"(.*?)\"/i)[1];
}
console.log (result);

To be sure that matched characters are surrounded by the same quotes:
/\.js\((['"])(.*?)\1\)/g

Why isn't this regex matching the expected way?

I'm trying to get rid of the slash character in case it exists at the end of my string. I used the following expression, intending to match any character not being slash at the end of the line.
var str = "http://hazaa.com/blopp/";
str.match("[^/$]+", "g");
For some reason (surely logical and explainable but not graspabled to me on my own), I get the split into three string looking as follows.
["http:", "hazaa.com", "blopp"]
What am I assuming wrongly?
How to resolve it?

In str.match("[^/$]+", "g");, why put dollar sign inside bracket? It's supposed to be outside, namely, str.match("[^/]+$", "g");.
To remove all the trailing slash, you can use str.replace(/\/+$/, ""). (If you'd like to remove the last trailing slash ONLY, remove the + in the replace's regex)
Update:
One more way that doesn't use replace:
function stripEndingSlashes(str) {
var matched = str.match("(.*[^/]+)/*$");
return matched ? matched[1] : "";
}

The regexp is choosing "everything except slash". That is why match() returns the parts of the string between slashes.
You can resolve it with the replace() function:
var str = "http://hazaa.com/blopp/";
//replace the last slash with an empty string to remove it
str = str.replace(/\/$/,'');
The regexp literal should always be surrounded between / characters. So here the regexp is:
\/ : this means a single slash character. In order to prevent Javascript from interpreting your slash as the end of regexp, it needs to be 'escaped' with a backslash.
$ : this means the end of the string

Your current regex will match the portion of string until the first / or $ is encountered. The second parameter is ignored; there is no second parameter for String.match.
To remove the trailing slash, use the String.replace function:
var str = "http://hazaa.com/blopp/";
str = str.replace(/\/$/, "");
console.log(str);
// "http://hazaa.com/blopp"
If you need to check whether a string ends with a slash, use the String.match method like this:
var str = "http://hazaa.com/blopp/";
var match = str.match(/\/$/);
console.log(match);
// null if string does not end with /
// ["/"] if string ends with a /
If you need to grab every thing except the last character(s) being /, use this:
var r = /(.+?)\/*$/;
console.log("http://hazaa.com/blopp//".match(r)); // ["http://hazaa.com/blopp//", "http://hazaa.com/blopp"]
console.log("http://hazaa.com/blopp/".match(r)); // ["http://hazaa.com/blopp/", "http://hazaa.com/blopp"]
console.log("http://hazaa.com/bloppA".match(r)); // ["http://hazaa.com/bloppA", "http://hazaa.com/bloppA"]
The 2nd index in the returned array contains the desired portion of the URL. The regex works as follows:
(.+?) un-greedy match (and capture) any character
\/*$ matches optional trailing slash(es)
The first portion regex is intentionally changed to un-greedy. If it was greedy, it would attempt to find the biggest match as long the the whole regex matches (consuming the trailing / in the process). When ungreedy, it will find the smallest match as long as the whole regex matches.

Why use regex? Just check if the last symbol of the string is a slash and then slice. Like this:
if (str.slice(-1) === '/') {
str = str.slice(0, -1);
}

jQuery remove special characters from string and more

I have a string like this:
var str = "I'm a very^ we!rd* Str!ng.";
What I would like to do is removing all special characters from the above string and replace spaces and in case they are being typed, underscores, with a - character.
The above string would look like this after the "transformation":
var str = 'im-a-very-werd-strng';

replace(/[^a-z0-9\s]/gi, '') will filter the string down to just alphanumeric values and replace(/[_\s]/g, '-') will replace underscores and spaces with hyphens:
str.replace(/[^a-z0-9\s]/gi, '').replace(/[_\s]/g, '-')
Source for Regex: RegEx for Javascript to allow only alphanumeric
Here is a demo: http://jsfiddle.net/vNfrk/

Assuming by "special" you mean non-word characters, then that is pretty easy.
str = str.replace(/[_\W]+/g, "-")

str.toLowerCase().replace(/[\*\^\'\!]/g, '').split(' ').join('-')

Remove numbers, underscore, white-spaces and special characters from the string sentence.
str.replace(/[0-9`~!##$%^&*()_|+\-=?;:'",.<>\{\}\[\]\\\/]/gi,'');
Demo

this will remove all the special character
str.replace(/[_\W]+/g, "");
this is really helpful and solve my issue. Please run the below code and ensure it works
var str="hello world !#to&you%*()";
console.log(str.replace(/[_\W]+/g, ""));

Since I can't comment on Jasper's answer, I'd like to point out a small bug in his solution:
str.replace(/[^a-z0-9\s]/gi, '').replace(/[_\s]/g, '-');
The problem is that first code removes all the hyphens and then tries to replace them :)
You should reverse the replace calls and also add hyphen to second replace regex. Like this:
str.replace(/[_\s]/g, '-').replace(/[^a-z0-9-\s]/gi, '');

Remove/Replace all special chars in Jquery :
If
str = My name is "Ghanshyam" and from "java" background
and want to remove all special chars (") then use this
str=str.replace(/"/g,' ')
result:
My name is Ghanshyam and from java background
Where g means Global

var str = "I'm a very^ we!rd* Str!ng.";
$('body').html(str.replace(/[^a-z0-9\s]/gi, " ").replace(/^\s+|\s+$|\s+(?=\s)/g, "").replace(/[_\s]/g, "-").toLowerCase());
First regex remove special characters with spaces than remove extra spaces from string and the last regex replace space with "-"

We Keep Coding

JavaScript is the programming language of the Web.

How to convert a string with regex - javascript

How can i convert a string with regex, so that it contains just alphabetical (a-z) or a hyphen. It should get rid " ' ! ? . and so on. Even if they appear multiple times. // if i have e.g. var test = '"test!!!"'; // how can i get the value "test"? Can sombody help. RegEx is totaly new to me.

Simply replace the characters you don't want: '"test!!!"'.replace(/[^a-z-]/gi, '') [^a-z-] matches all characters but a-z and the hyphen. The /g flag makes the regexp apply multiple times. The /i flag (optional) makes it match case-insensitively, i.e. not replace upper-case characters.

That's quite simple: You build a character class that matches everything except those chars you want and remove them by replacing each occurence (global flag) with the empty string: return str.replace(/[^a-z-]/g, "");

str = "hello!! my + name $ is slim-shady"; console.log(str.replace(/[^a-z-]+/g, '')); $ node src/java/regex/alphanum.js hellomynameisslim-shady

Use the replace method for any string variable and specify the characters you wish to remove. Here's an example: var sampleString = ("Hello World!!"); //Sample of what you have. var holdData = sampleString.replace(/!!/gi, ''); window.alert(holdData);

var str = "test!!!"; str = str.replace(/[^A-Za-z\-]/g,"");

Related

regex replace only a part of the match

How to remove everything up until the first character in Javascript?

Javascript reg exp not right

Why isn't this regex matching the expected way?

jQuery remove special characters from string and more

Categories

Resources