Capturing groups in string using preg_match

Question

I got in trouble parsing a text file in codeigniter, for each line in file I need to capture groups data...the data are: - progressive number - operator - manufacturer - model - registration - type ...

Accepted Answer

You should not use w for capturing the data as some of the characters in your text like &, ., - and / are not part of word characters. Moreover some of them are space separated, so you should replace w{1,} with S+(?: S+)* which will capture your text properly into groups you have made.Try changing your regex to this and it should work,^s*(d+)s+(S+(?: S+)*)s+(S+(?: S+)*)s+(S+(?: S+)*)s+(S+(?: S+)*)s+(S+(?: S+)*)$Check this demoExplanation of what S+(?: S+)* does in above regex.S+ &#8211; S is opposite of s meaning it matches any non-whitespace (won&#8217;t match a space or tab or newline or vertical space or horizontal space and in general any whitespace) character. Hence S+ matches one or more visible characters(?: S+)* &#8211; Here ?: is only for turning a group as non-capture group and following it has a space and S+ and all of it is enclosed in parenthesis with * quantifier. So this means match a space followed by one or more non-whitespace character and whole of it zero or more times as * quantifier is used.So S+(?: S+) will match abc or abc xyz or abc pqr xyz and so on but the moment more than one space appears, the match stops as there is only a single space present in the regex before S+Hope my explanation is clear. If still any doubt, please feel free to ask.

Advertisement

Answer