Ok, I'm really pushing my luck but why not try?
I'm including a fair test sample of the various text formats I get. (the names, emails and phone numbers have been altered to protect identities)
About 60% of the responses are like the first 4 examples.
The other 40% are formatted various ways.
Is there a way to 'fairly' reliably get the email address and the 5 digit zip code parsed out into variables?
really appreciate any input
Shaw Guyery
Newark, NJ 07103
shaxxxx@gmail.com
+1 732 621 0987
Authorized to work in the US for any employer Work Experience
Waitress/Food Service Associate
Ger Hoh
Brooklyn, NY 11203
gmaxxx@aol.com
+1 347 356 1987 Dear Hiring Manger,
I am seeking a position in yo
Jenn Hern
New York, NY 10033
jenniferhernandezxx@gmail.com
+1 911 757 7492
Competent housekeeper with over 4 years of experience in providing excellent housekeeping services in hotel and private residence settings. Capable of handling work and staff pressure in fast- paced environmen
Kat Kla
Wallington, NJ 07057
katrisxxxx@gmail.com
+1 201 702 3899
Work Experience
Housekeeper
errolalexxxx@gmail.com
+1 347 422 1789
To obtain a position that would provide me an opportunity to grow and utilize my skills. Authorized to work in the US for any employer
Work Experience
Scheduler
Bri Bet
4100 Central Ave Rochelle Park, NJ 07662 551-206-1345 Betancurxxx@gmail.com
Objective: Seeking an opportunity within an organization which will utilize my strengths and skills while providing opportunity for professional growth
Summary of Skills
Not sure how this needs to fit into the whole flow, but as always, you can use simpler regexes if they are used from a scripting language which has a split function.
In JS, for example, you might start by trying this kind of thing:
Hey man, I am not able to test until tomorrow. Wanted to let you know each one of those ‘paragraphs’ are a different example of the text I will ‘scrape’.
I don’t need to do that whole block. Only one at a time.
FYI - I’m automating getting that web data into my Filemaker pro database. Setting variables in KM, then setting AppleScript variables, then picking them ‘up’ in FileMaker.
Hi @ComplexPoint I apologize for not being clear on the need. I should have done better.
I am very appreciative of your time and expertise, really.
I used the macro and it works even for a single paragraph so I'm good, the one thing, I don't know how to parse out the 'fullName', 'email' and 'zip from a single result in the %Variable%candidatesJSON%
I tried looking at a previous macro you provided "Name and number from penultimate two lines" and mimicking that but to no avail.
I don't know how to parse out the 'fullName', 'email' and 'zip from a single result in the %Variable%candidatesJSON%
I tried looking at a previous macro you provided "Name and number from penultimate two lines" and mimicking that but to no avail.
A single input is text that I have to highlight and copy on a web page that is embedded. It is not obtainable by a select all / copy all command, so yes, it is manually copied by me.
Any of the following blocks could be a single input: there are 6 different examples in the following text.
Shaw Guyery
Newark, NJ 07103
shaxxxx@gmail.com
+1 732 621 0987
Authorized to work in the US for any employer Work Experience
Waitress/Food Service Associate
Ger Hoh
Brooklyn, NY 11203
gmaxxx@aol.com
+1 347 356 1987 Dear Hiring Manger,
I am seeking a position in yo
Jenn Hern
New York, NY 10033
jenniferhernandezxx@gmail.com
+1 911 757 7492
Competent housekeeper with over 4 years of experience in providing excellent housekeeping services in hotel and private residence settings. Capable of handling work and staff pressure in fast- paced environmen
Kat Kla
Wallington, NJ 07057
katrisxxxx@gmail.com
+1 201 702 3899
Work Experience
Housekeeper
errolalexxxx@gmail.com
+1 347 422 1789
To obtain a position that would provide me an opportunity to grow and utilize my skills. Authorized to work in the US for any employer
Work Experience
Scheduler
Bri Bet
4100 Central Ave Rochelle Park, NJ 07662 551-206-1345 Betancurxxx@gmail.com
Objective: Seeking an opportunity within an organization which will utilize my strengths and skills while providing opportunity for professional growth
Summary of Skills
but on a single block of text or candidate. I get the following response and don't know how to get it parsed out 'cleaner' without the the brackets, etc etc......
erg!!! I am telling you =), I tried that exact code in the last step (obviously not because it didn't work) - I looked at one of your previous macros and mimicked it..... and no workee.....
Hi @ComplexPoint , I'm getting some fails on certain 'candidates' 'data scrape'.
I included some of the Failed data and successful data examples:
Fails:
Troy Krueger
601 West 57th Street, Apt. 14H
New York, New York 10019
Phone: 1-646-220-1111
E-mail: troykrueger.guerra30@yahoo.com
Fails:
Troy Krueger
Ria Batson Resume
Brooklyn, NY 11212
troykrueger.batson@gmail.com
+1 929 225 1111
Fails:
Troy Krueger
Mount Vernon, NY
troykrueger1975@icloud.com
+1 914 573 1111
WORKS:
Troy Krueger
New York, NY 11369
troykruegerl59@gmail.com
+1 347 237 1111
WORKS:
Troy Di Krueger
Bookseller
New Rochelle, NY 10805
troykrueger@gmail.com
+1 908 601 1111
I truly appreciate your help, no worries if I have 'over extended' my stay!
cheers
Yes it is a correct zip code....
I appreciate your response....
I think I'll have to do a work around.... the data is just not formatted in a consistent way, which is making it very difficult to automate 'scraping' it.
Thru the help of these awesome folks on the forum I have gotten it much more automated than it was at the beginning using a bunch of copies and pastes....
all good, cheers
As the pattern matching only needs to find a line number, and the ZIP patterns are not that regular – don't for example, necessarily involve 5 digits at all – you might be able to relax the patterns to something like: