Match everything between certain characters

Match everything between certain characters - javascript

Wow, I suck at regex
http://regex101.com/r/lM8oX3
([*][.]+[*])
I'm trying to match text such as this:
*hello*

Just try with following regex:
(\*[^*]+\*)
In your regex you have [.] which in fact searches for dots because in [] it loses its special context and is treated as a normal character. You should better use .+ then but it will match also * characters. So use my above solution then.
Live demo

This will capture
var text = "asdfasdf *hello*";
console.log( text.match(/([*][^*]+[*])/)[1]);
But that only grabs the first match;
If you want all matches
var text = "asdfasdf *hello* asdffdsa *asdf*";
var matches = text.match(/([*][^*]+[*])/g);
if(matches.length > 1) {
for(var i=1; i<matches.length; i++) {
console.log(matches[i]);
}
}

Related

Regex match specific word and word with repeat lettres

in js (nodejs) i try to catch a specific word and this same word write with repeat letter ...
exemple, if i say world, i want catch this:
world woorld wwooorld worllddd worldddd
so i can juste make a for on my world an create a dynamique regexw with something like this:
var w = 'world';
var regex = '';
for(var i = 0; i < w.lengh; i++){
regex += `${w[i]}+`;
}
but can i do this only with one unique regex ?? ( or other original idea )

Your idea of constructing the regex is correct. You need however to take care that characters with a special meaning in regular expressions do not make things go wrong. So you could escape non-alphanumerical characters with a backslash:
var w = "world+hello";
var regex = new RegExp('^' + w.replace(/\W/g, '\\$&+').replace(/\w/g, '$&+') + '$');
console.log(regex, regex.test('wwoooorllldd++++hhelllo'));

The following pattern would match any combination of letter repeats for "world":
"w+o+r+l+d+"
The + denotes capturing 1 or more of the previous character.

Retrieving several capturing groups recursively with RegExp

I have a string with this format:
#someID#tn#company#somethingNew#classing#somethingElse#With
There might be unlimited #-separated words, but definitely the whole string begins with #
I have written the following regexp, though it matches it, but I cannot get each #-separated word, and what I get is the last recursion and the first (as well as the whole string). How can I get an array of every word in an element separately?
(?:^\#\w*)(?:(\#\w*)+) //I know I have ruled out second capturing group with ?: , though doesn't make much difference.
And here is my Javascript code:
var reg = /(?:^\#\w*)(?:(\#\w*)+)/g;
var x = null;
while(x = reg.exec("#someID#tn#company#somethingNew#classing#somethingElse#With"))
{
console.log(x);
}
And here is the result (Firebug, console):
["#someID#tn#company#somet...sing#somethingElse#With", "#With"]
0
"#someID#tn#company#somet...sing#somethingElse#With"
1
"#With"
index
0
input
"#someID#tn#company#somet...sing#somethingElse#With"
EDIT :
I want an output like this with regular expression if possible:
["#someID", "#tn", #company", "#somethingNew", "#classing", "#somethingElse", "#With"]
NOTE that I want a RegExp solution. I know about String.split() and String operations.

You can use:
var s = '#someID#tn#company#somethingNew#classing#somethingElse#With'
if (s.substr(0, 1) == "#")
tok = s.substr(1).split('#');
//=> ["someID", "tn", "company", "somethingNew", "classing", "somethingElse", "With"]

You could try this regex also,
((?:#|#)\w+)
DEMO
Explanation:
() Capturing groups. Anything inside this capturing group would be captured.
(?:) It just matches the strings but won't capture anything.
#|# Literal # or # symbol.
\w+ Followed by one or more word characters.
OR
> "#someID#tn#company#somethingNew#classing#somethingElse#With".split(/\b(?=#|#)/g);
[ '#someID',
'#tn',
'#company',
'#somethingNew',
'#classing',
'#somethingElse',
'#With' ]

It will be easier without regExp:
var str = "#someID#tn#company#somethingNew#classing#somethingElse#With";
var strSplit = str.split("#");
for(var i = 1; i < strSplit.length; i++) {
strSplit[i] = "#" + strSplit[i];
}
console.log(strSplit);
// ["#someID", "#tn", "#company", "#somethingNew", "#classing", "#somethingElse", "#With"]

Get all words starting with X and ending with Y

I have got a textarea with keyup=validate()
I need a javascript function that gets all words starting with # and ending with a character that is not A-Za-z0-9
For example:
This is a text #user1 this is more text #user2. And this is even more #user3!
The function gives an array:
Array("#user1","#user2","#user3");
I am sure there must be a way to do this written on somewhere on the internet if I just google something but I have no idea what I have to look for.. I am very new with regular expresions.
Thank you very much!

The regular expression you want is:
/#[a-z\d]+/ig
This matches # followed by a sequence of letters and numbers. The i modifier makes it case-insensitive, so you don't have to put A-Z in the character class, and g makes it find all the matches.
var str = "This is a text #user1 this is more text #user2. And this is even more #user3!";
var matches = str.match(/#[a-z\d]+/ig);
console.log(matches);

JS
var str = "This is a text #user1 this is more text #user2. And this is even more #user3!",
var textArr = str.split(" ");
for(var i = 0; i < textArr.length; i++) {
var test = textArr[i];
matches = test.match(/^#.*.[A-Za-z0-9]$/);
console.log(matches);
};
Explanation:
You should also read about the regex(http://www.w3schools.com/jsref/jsref_obj_regexp.asp) and match(http://www.w3schools.com/jsref/jsref_match.asp) to get an idea how it works.
Basically, applying ^# means starting the regex look for #. $ means ending with. and .* any character in between.
To Test: http://www.regular-expressions.info/javascriptexample.html

Thanks for the replies above, they've helped me - Where I've written this method that hopefully answers the question about having a start and end regex check.
In this example it looks for ##_ at the start and _## at the end
e.g. ##_ anyTokenYouNeedToFind _##.
Code:
const tokenSearchHelper = (inputText) => {
let matches = inputText.match(/##_[a-zA-Z0-9_\d]+_##/ig);
return matches;
}
const out = tokenSearchHelper("Hello ##_World_##");
console.log(out);

How to get the characters preceded by "add_"

I have a strings "add_dinner", "add_meeting", "add_fuel_surcharge" and I want to get characters that are preceded by "add_" (dinner, meeting, fuel_surcharge).
[^a][^d]{2}[^_]\w+
I have tried this one, but it only works for "add_dinner"
[^add_]\w+
This one works for "add_fuel_surcharge", but takes "inner" from "add_dinner"
Help me to understand please.

Use capturing groups:
/^add_(\w+)$/
Check the returned array to see the result.

Since JavaScript doesn't support lookbehind assertions, you need to use a capturing group:
var myregexp = /add_(\w+)/;
var match = myregexp.exec(subject);
if (match != null) {
result = match[1];
}
[^add_] is a character class that matches a single character except a, d or _. When applied to add_dinner, the first character it matches is i, and \w+ then matches nner.

The [^...] construct matches any single character except the ones listed. So [^add_] matches any single character other than "a", "d" or "_".
If you want to retrieve the bit after the _ you can do this:
/add_(\w+_)/
Where the parentheses "capture" the part of the expression inside. So to get the actual text from a string:
var s = "add_meeting";
var result = s.match(/add_(\w+)/)[1];
This assumes the string will match such that you can directly get the second element in the returned array that will be the "meeting" part that matched (\w+).
If there's a possibility that you'll be testing a string that won't match you need to test that the result of match() is not null.
(Or, possibly easier to understand: result = "add_meeting".split("_")[1];)

You can filter _ string by JavaScript for loop ,
var str = ['add_dinner', 'add_meeting', 'add_fuel_surcharge'];
var filterString = [];
for(var i = 0; i < str.length; i ++){
if(str[i].indexOf("_")>-1){
filterString.push(str[i].substring(str[i].indexOf("_") + 1, str[i].length));
}
}
alert(filterString.join(", "));

Regex won't match words as expected

I am trying to use XRegExp to test if a string is a valid word according to these criteria:
The string begins with one or more Unicode letters, followed by
an apostrophe (') followed by one or more Unicode letters, repeated 0 or more times.
The string ends immediately after the matched pattern.
That is, it will match these terms
Hello can't Alah'u'u'v'oo O'reilly
but not these
eatin' 'sup 'til
I am trying this pattern,
^(\\p{L})+('(\\p{L})+)*$
but it won't match any words that contain apostrophes. What am I doing wrong?
EDIT: The code using the regex
var separateWords = function(text) {
var word = XRegExp("(\\p{L})+('(\\p{L})+)*$");
var splits = [];
for (var i = 0; i < text.length; i++) {
var item = text[i];
while (i + 1 < text.length && word.test(item + text[i + 1])) {
item += text[i + 1];
i++;
}
splits.push(item);
}
return splits;
};

I think you will need to omit the string start/end anchors to match single words:
"(\\p{L})+('(\\p{L})+)*"
Also I'm not sure what those capturing groups are needed for (that may depend on your application), but you could shorten them to
"\\p{L}+('\\p{L}+)*"

Try this regex:
^[^'](?:[\w']*[^'])?$
First it checks to ensure the first character is not an apostrophe. Then it either gets any number of word characters or apostrophes followed by anything other than an apostrophe, or it gets nothing (one-letter word).

We Keep Coding

JavaScript is the programming language of the Web.

Match everything between certain characters - javascript

Wow, I suck at regex http://regex101.com/r/lM8oX3 ([][.]+[]) I'm trying to match text such as this: hello

Just try with following regex: (\[^]+\) In your regex you have [.] which in fact searches for dots because in [] it loses its special context and is treated as a normal character. You should better use .+ then but it will match also characters. So use my above solution then. Live demo

Related

Regex match specific word and word with repeat lettres

Retrieving several capturing groups recursively with RegExp

Get all words starting with X and ending with Y

How to get the characters preceded by "add_"

Regex won't match words as expected

Categories

Resources

We Keep Coding

JavaScript is the programming language of the Web.

Match everything between certain characters - javascript

Wow, I suck at regex http://regex101.com/r/lM8oX3 ([*][.]+[*]) I'm trying to match text such as this: *hello*

Just try with following regex: (\*[^*]+\*) In your regex you have [.] which in fact searches for dots because in [] it loses its special context and is treated as a normal character. You should better use .+ then but it will match also * characters. So use my above solution then. Live demo

Related

Regex match specific word and word with repeat lettres

Retrieving several capturing groups recursively with RegExp

Get all words starting with X and ending with Y

How to get the characters preceded by "add_"

Regex won't match words as expected

Categories

Resources

Wow, I suck at regex http://regex101.com/r/lM8oX3 ([][.]+[]) I'm trying to match text such as this: hello

Just try with following regex: (\[^]+\) In your regex you have [.] which in fact searches for dots because in [] it loses its special context and is treated as a normal character. You should better use .+ then but it will match also characters. So use my above solution then. Live demo